wedata

Database: AutoPagerize

Item Details: Tumblr

url ^.+://[^/]+\.tumblr\.com/
nextLink //a[contains(@href,"/page/")][number(substring-after(self::a/@href,"/page/"))-number(substring-after(preceding::a[contains(@href,"/page/")][1]/@href,"/page/")) = 2 or number(substring-after(self::a/@href,"/page/"))-number(substring-after(following::a[contains(@href,"/page/")][1]/@href,"/page/")) = 2]|//a[starts-with(@href,"/page/") or contains(@href,"/search/") or contains(@href,"/tagged/")][contains(@href,"/page/2")][number(substring-after(following::a[contains(@href,"/page/")][1]/@href,"/page/")) != 4 and number(substring-after(preceding::a[contains(@href,"/page/")][1]/@href,"/page/")) != 4]
pageElement //*[count(child::*[.//a[@class="permalink"] | .//a[child::img[@class="permalink"]] | .//*[@class="date"]/a | .//div[contains(concat(" ",normalize-space(@class)," ")," permalink ")]/a | .//a[contains(@title,"permalink")] | .//a[@class="fecha"] |.//a[contains(@href,"/post/")]])>=4]/*[.//a[@class="permalink"] | .//a[child::img[@class="permalink"]] | .//*[@class="date"]/a | .//div[contains(concat(" ",normalize-space(@class)," ")," permalink ")]/a | .//a[contains(@title,"permalink")] | .//a[@class="fecha"] |.//a[contains(@href,"/post/")]]
exampleUrl http://taizooo.tumblr.com/ http://taizooo.tumblr.com/page/3 http://fjmo.tumblr.com/ http://fjmo.tumblr.com/page/3 http://otsune.tumblr.com/search/neodenjin http://illillill.tumblr.com/page/2 http://cxx.tumblr.com/tagged/reblog/page/2 http://mikirosi.tumblr.com/page/2
comment http://taizooo.tumblr.com/post/65704255/nextlink-2 hAtom : http://wedata.net/items/31114 と AutoPagerizeFORMAT : http://wedata.net/items/31158 よりこの SITEINFO を優先度を低くするために url の文字数を hAtom より1文字以上短かくする必要があります。また ^http://ku\.tumblr\.com/ といった個別の SITEINFO より優先度を下げるため url は可能なかぎり短くする必要があります。 pageElement は LDRize SITEINFO : http://wedata.net/items/28521?rev=41785 の paragraph をベースにしました。

History

Back