<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Datasets &#8211; Web Scraping Service</title>
	<atom:link href="https://webrobots.io/category/datasets/feed/" rel="self" type="application/rss+xml" />
	<link>https://webrobots.io</link>
	<description>We do web scraping service better!</description>
	<lastBuildDate>Tue, 28 Apr 2020 11:02:25 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.5.8</generator>
	<item>
		<title>Instant Data Users Group on Facebook</title>
		<link>https://webrobots.io/instant-data-users-group-on-facebook/</link>
					<comments>https://webrobots.io/instant-data-users-group-on-facebook/#respond</comments>
		
		<dc:creator><![CDATA[nicerobot]]></dc:creator>
		<pubDate>Tue, 28 Apr 2020 11:02:25 +0000</pubDate>
				<category><![CDATA[Datasets]]></category>
		<category><![CDATA[Web Scraping]]></category>
		<guid isPermaLink="false">https://webrobots.io/?p=6110</guid>

					<description><![CDATA[We have launched a Facebook group where Instant Data Scraper users will be able to find support for the extension which currently has 65k users. This extension is wildly popular, but at the same time it is completely free, hence Web Robots has limited capacity to answer questions arising from users. We hope that [...]]]></description>
										<content:encoded><![CDATA[<div class="fusion-fullwidth fullwidth-box fusion-builder-row-1 nonhundred-percent-fullwidth non-hundred-percent-height-scrolling"  style='background-color: #ffffff;background-position: center center;background-repeat: no-repeat;padding-top:0px;padding-right:0px;padding-bottom:0px;padding-left:0px;border-top-width:0px;border-bottom-width:0px;border-color:#eae9e9;border-top-style:solid;border-bottom-style:solid;'><div class="fusion-builder-row fusion-row "><div  class="fusion-layout-column fusion_builder_column fusion_builder_column_1_1 fusion-builder-column-0 fusion-one-full fusion-column-first fusion-column-last 1_1"  style='margin-top:0px;margin-bottom:20px;'><div class="fusion-column-wrapper" style="padding: 0px 0px 0px 0px;background-position:left top;background-repeat:no-repeat;-webkit-background-size:cover;-moz-background-size:cover;-o-background-size:cover;background-size:cover;"   data-bg-url=""><div class="fusion-text"><p>We have launched a <a href="https://www.facebook.com/groups/instantdata/">Facebook group</a> where <a href="https://chrome.google.com/webstore/detail/instant-data-scraper/ofaokhiedipichpaobibbnahnkdoiiah">Instant Data Scraper</a> users will be able to find support for the extension which currently has 65k users. This extension is wildly popular, but at the same time it is completely free, hence Web Robots has limited capacity to answer questions arising from users.</p>
<p>We hope that new Facebook group will grow into a community where users can support each other.</p>
</div><div class="imageframe-align-center"><div class="fusion-image-frame-bottomshadow image-frame-shadow-1"><style>.fusion-image-frame-bottomshadow.image-frame-shadow-1{display:inline-block}.element-bottomshadow.imageframe-1:before, .element-bottomshadow.imageframe-1:after{-webkit-box-shadow: 0 17px 10px rgba(0,0,0,0.4);box-shadow: 0 17px 10px rgba(0,0,0,0.4);}</style><span class="fusion-imageframe imageframe-bottomshadow imageframe-1 element-bottomshadow hover-type-none"><a class="fusion-no-lightbox" href="https://www.facebook.com/groups/instantdata/" target="_blank" aria-label="Community Support Group" rel="noopener noreferrer"><img fetchpriority="high" decoding="async" src="https://webrobots.io/wp-content/uploads/2020/04/unnamed.png" data-orig-src="https://webrobots.io/wp-content/uploads/2020/04/unnamed.png" width="500" height="228" alt="Community Support Group" class="lazyload img-responsive wp-image-6111" srcset="data:image/svg+xml,%3Csvg%20xmlns%3D%27http%3A%2F%2Fwww.w3.org%2F2000%2Fsvg%27%20width%3D%27500%27%20height%3D%27228%27%20viewBox%3D%270%200%20500%20228%27%3E%3Crect%20width%3D%27500%27%20height%3D%273228%27%20fill-opacity%3D%220%22%2F%3E%3C%2Fsvg%3E" data-srcset="https://webrobots.io/wp-content/uploads/2020/04/unnamed-200x91.png 200w, https://webrobots.io/wp-content/uploads/2020/04/unnamed-400x182.png 400w, https://webrobots.io/wp-content/uploads/2020/04/unnamed.png 500w" data-sizes="auto" data-orig-sizes="(max-width: 800px) 100vw, 500px" /></a></span></div></div><div class="fusion-clearfix"></div></div></div></div></div><style type="text/css">.fusion-fullwidth.fusion-builder-row-1 a:not(.fusion-button):not(.fusion-builder-module-control):not(.fusion-social-network-icon):not(.fb-icon-element):not(.fusion-countdown-link):not(.fusion-rollover-link):not(.fusion-rollover-gallery):not(.fusion-button-bar):not(.add_to_cart_button):not(.show_details_button):not(.product_type_external):not(.fusion-quick-view):not(.fusion-rollover-title-link):not(.fusion-breadcrumb-link) , .fusion-fullwidth.fusion-builder-row-1 a:not(.fusion-button):not(.fusion-builder-module-control):not(.fusion-social-network-icon):not(.fb-icon-element):not(.fusion-countdown-link):not(.fusion-rollover-link):not(.fusion-rollover-gallery):not(.fusion-button-bar):not(.add_to_cart_button):not(.show_details_button):not(.product_type_external):not(.fusion-quick-view):not(.fusion-rollover-title-link):not(.fusion-breadcrumb-link):before, .fusion-fullwidth.fusion-builder-row-1 a:not(.fusion-button):not(.fusion-builder-module-control):not(.fusion-social-network-icon):not(.fb-icon-element):not(.fusion-countdown-link):not(.fusion-rollover-link):not(.fusion-rollover-gallery):not(.fusion-button-bar):not(.add_to_cart_button):not(.show_details_button):not(.product_type_external):not(.fusion-quick-view):not(.fusion-rollover-title-link):not(.fusion-breadcrumb-link):after {color: #03a9f4;}.fusion-fullwidth.fusion-builder-row-1 a:not(.fusion-button):not(.fusion-builder-module-control):not(.fusion-social-network-icon):not(.fb-icon-element):not(.fusion-countdown-link):not(.fusion-rollover-link):not(.fusion-rollover-gallery):not(.fusion-button-bar):not(.add_to_cart_button):not(.show_details_button):not(.product_type_external):not(.fusion-quick-view):not(.fusion-rollover-title-link):not(.fusion-breadcrumb-link):hover, .fusion-fullwidth.fusion-builder-row-1 a:not(.fusion-button):not(.fusion-builder-module-control):not(.fusion-social-network-icon):not(.fb-icon-element):not(.fusion-countdown-link):not(.fusion-rollover-link):not(.fusion-rollover-gallery):not(.fusion-button-bar):not(.add_to_cart_button):not(.show_details_button):not(.product_type_external):not(.fusion-quick-view):not(.fusion-rollover-title-link):not(.fusion-breadcrumb-link):hover:before, .fusion-fullwidth.fusion-builder-row-1 a:not(.fusion-button):not(.fusion-builder-module-control):not(.fusion-social-network-icon):not(.fb-icon-element):not(.fusion-countdown-link):not(.fusion-rollover-link):not(.fusion-rollover-gallery):not(.fusion-button-bar):not(.add_to_cart_button):not(.show_details_button):not(.product_type_external):not(.fusion-quick-view):not(.fusion-rollover-title-link):not(.fusion-breadcrumb-link):hover:after {color: #0074a2;}.fusion-fullwidth.fusion-builder-row-1 .pagination a.inactive:hover, .fusion-fullwidth.fusion-builder-row-1 .fusion-filters .fusion-filter.fusion-active a {border-color: #0074a2;}.fusion-fullwidth.fusion-builder-row-1 .pagination .current {border-color: #0074a2; background-color: #0074a2;}.fusion-fullwidth.fusion-builder-row-1 .fusion-filters .fusion-filter.fusion-active a, .fusion-fullwidth.fusion-builder-row-1 .fusion-date-and-formats .fusion-format-box, .fusion-fullwidth.fusion-builder-row-1 .fusion-popover, .fusion-fullwidth.fusion-builder-row-1 .tooltip-shortcode {color: #0074a2;}#main .fusion-fullwidth.fusion-builder-row-1 .post .blog-shortcode-post-title a:hover {color: #0074a2;}</style>
]]></content:encoded>
					
					<wfw:commentRss>https://webrobots.io/instant-data-users-group-on-facebook/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>New Dataset &#8211; UK LPA Search</title>
		<link>https://webrobots.io/new-dataset-uk-lpa-search/</link>
					<comments>https://webrobots.io/new-dataset-uk-lpa-search/#respond</comments>
		
		<dc:creator><![CDATA[nicerobot]]></dc:creator>
		<pubDate>Wed, 27 Jan 2016 14:33:24 +0000</pubDate>
				<category><![CDATA[Datasets]]></category>
		<category><![CDATA[datasets]]></category>
		<guid isPermaLink="false">https://webrobots.io/?p=5300</guid>

					<description><![CDATA[We are excited to announce UK LPA Search - it is a search engine for all UK's local planning authorities. Until now there was no possibility to search LPA databases from one place. One had to find each LPA's website and search inside it. Considering there are few hundred of them - this would [...]]]></description>
										<content:encoded><![CDATA[<div class="fusion-fullwidth fullwidth-box fusion-builder-row-2 nonhundred-percent-fullwidth non-hundred-percent-height-scrolling"  style='background-color: #ffffff;background-position: center center;background-repeat: no-repeat;padding-top:0px;padding-right:0px;padding-bottom:0px;padding-left:0px;border-top-width:0px;border-bottom-width:0px;border-color:#eae9e9;border-top-style:solid;border-bottom-style:solid;'><div class="fusion-builder-row fusion-row "><div  class="fusion-layout-column fusion_builder_column fusion_builder_column_1_1 fusion-builder-column-1 fusion-one-full fusion-column-first fusion-column-last 1_1"  style='margin-top:0px;margin-bottom:0px;'><div class="fusion-column-wrapper" style="background-position:left top;background-repeat:no-repeat;-webkit-background-size:cover;-moz-background-size:cover;-o-background-size:cover;background-size:cover;"   data-bg-url=""><div class="fusion-text"><p>We are excited to announce <a href="http://lpasearch.co.uk" target="_blank" rel="noopener noreferrer">UK LPA Search</a> &#8211; it is a search engine for all UK&#8217;s local planning authorities. Until now there was no possibility to search LPA databases from one place. One had to find each LPA&#8217;s website and search inside it. Considering there are few hundred of them &#8211; this would not be an easy task for a human. Our robots have no problems indexing all databases and providing them as a single dataset.</p>
<p>A bonus point &#8211; we geocoded all requests and display them on a map. Therefore anyone can see what building permits are being issues around them. Example: <a href="http://lpasearch.co.uk/maps/London" target="_blank" rel="noopener noreferrer">Map of building permits in London</a></p>
</div><div class="fusion-clearfix"></div></div></div></div></div>
]]></content:encoded>
					
					<wfw:commentRss>https://webrobots.io/new-dataset-uk-lpa-search/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>New Kickstarter Dataset</title>
		<link>https://webrobots.io/new-kickstarter-dataset/</link>
					<comments>https://webrobots.io/new-kickstarter-dataset/#comments</comments>
		
		<dc:creator><![CDATA[nicerobot]]></dc:creator>
		<pubDate>Thu, 31 Dec 2015 08:16:20 +0000</pubDate>
				<category><![CDATA[Datasets]]></category>
		<category><![CDATA[kickstarter datasets]]></category>
		<category><![CDATA[web scraping]]></category>
		<guid isPermaLink="false">https://webrobots.io/?p=5289</guid>

					<description><![CDATA[Recently we updated our Kickstarter robot to crawl project subcategories. This allows us to collect a richer dataset, for example on 2015-12-17 run robot collected data about 144,263 projects with a running time only 2 hours! We also started presenting it in the JSON streaming format which is just a line delimited JSON. Previously we [...]]]></description>
										<content:encoded><![CDATA[<div class="fusion-fullwidth fullwidth-box fusion-builder-row-3 nonhundred-percent-fullwidth non-hundred-percent-height-scrolling"  style='background-color: #ffffff;background-position: center center;background-repeat: no-repeat;padding-top:0px;padding-right:0px;padding-bottom:0px;padding-left:0px;border-top-width:0px;border-bottom-width:0px;border-color:#eae9e9;border-top-style:solid;border-bottom-style:solid;'><div class="fusion-builder-row fusion-row "><div  class="fusion-layout-column fusion_builder_column fusion_builder_column_1_1 fusion-builder-column-2 fusion-one-full fusion-column-first fusion-column-last 1_1"  style='margin-top:0px;margin-bottom:0px;'><div class="fusion-column-wrapper" style="padding: 0px 0px 0px 0px;background-position:left top;background-repeat:no-repeat;-webkit-background-size:cover;-moz-background-size:cover;-o-background-size:cover;background-size:cover;"   data-bg-url=""><div class="fusion-text"><p>Recently we updated our Kickstarter robot to crawl project subcategories. This allows us to collect a richer dataset, for example on 2015-12-17 run robot collected data about 144,263 projects with a running time only 2 hours! We also started presenting it in the JSON streaming format which is just a line delimited JSON. Previously we used to stuff all projects into JSON array and the downside of it was that user would have to read the entire large JSON file into memory before any kind of processing starts. with JSON streaming it is possible to read one line at a time.</p>
<p>Data is posted in the usual <a href="https://webrobots.io/kickstarter-datasets/">place</a>.</p>
</div><div class="fusion-clearfix"></div></div></div></div></div>
]]></content:encoded>
					
					<wfw:commentRss>https://webrobots.io/new-kickstarter-dataset/feed/</wfw:commentRss>
			<slash:comments>2</slash:comments>
		
		
			</item>
		<item>
		<title>Fresh Kickstarter Datasets</title>
		<link>https://webrobots.io/fresh-kickstarter-datasets/</link>
					<comments>https://webrobots.io/fresh-kickstarter-datasets/#respond</comments>
		
		<dc:creator><![CDATA[nicerobot]]></dc:creator>
		<pubDate>Thu, 22 Oct 2015 13:57:35 +0000</pubDate>
				<category><![CDATA[Datasets]]></category>
		<category><![CDATA[kickstarter datasets]]></category>
		<guid isPermaLink="false">http://webrobots.io/?p=5270</guid>

					<description><![CDATA[We have been swamped with work and have not updated our Kickstarter dataset page in while. To correct this today we posted new datasets retrieved in June, August and October. They are listed in the usual place: http://webrobots.io/kickstarter-datasets/ Enjoy!]]></description>
										<content:encoded><![CDATA[<div class="fusion-fullwidth fullwidth-box fusion-builder-row-4 nonhundred-percent-fullwidth non-hundred-percent-height-scrolling"  style='background-color: #ffffff;background-position: center center;background-repeat: no-repeat;padding-top:0px;padding-right:0px;padding-bottom:0px;padding-left:0px;border-top-width:0px;border-bottom-width:0px;border-color:#eae9e9;border-top-style:solid;border-bottom-style:solid;'><div class="fusion-builder-row fusion-row "><div  class="fusion-layout-column fusion_builder_column fusion_builder_column_1_1 fusion-builder-column-3 fusion-one-full fusion-column-first fusion-column-last 1_1"  style='margin-top:0px;margin-bottom:0px;'><div class="fusion-column-wrapper" style="padding: 0px 0px 0px 0px;background-position:left top;background-repeat:no-repeat;-webkit-background-size:cover;-moz-background-size:cover;-o-background-size:cover;background-size:cover;"   data-bg-url=""><div class="fusion-text"><p>We have been swamped with work and have not updated our Kickstarter dataset page in while. To correct this today we posted new datasets retrieved in June, August and October. They are listed in the usual place: <a href="http://webrobots.io/kickstarter-datasets/">http://webrobots.io/kickstarter-datasets/</a></p>
<p>Enjoy!</p>
<p><a href="http://webrobots.io/wp-content/uploads/2015/10/Kickstarter-dataset-screenshot.png"><img decoding="async" class="lazyload aligncenter wp-image-5271 size-full" src="http://webrobots.io/wp-content/uploads/2015/10/Kickstarter-dataset-screenshot.png" data-orig-src="http://webrobots.io/wp-content/uploads/2015/10/Kickstarter-dataset-screenshot.png" alt="Kickstarter dataset screenshot" width="1045" height="427" srcset="data:image/svg+xml,%3Csvg%20xmlns%3D%27http%3A%2F%2Fwww.w3.org%2F2000%2Fsvg%27%20width%3D%271045%27%20height%3D%27427%27%20viewBox%3D%270%200%201045%20427%27%3E%3Crect%20width%3D%271045%27%20height%3D%273427%27%20fill-opacity%3D%220%22%2F%3E%3C%2Fsvg%3E" data-srcset="https://webrobots.io/wp-content/uploads/2015/10/Kickstarter-dataset-screenshot-300x123.png 300w, https://webrobots.io/wp-content/uploads/2015/10/Kickstarter-dataset-screenshot-669x272.png 669w, https://webrobots.io/wp-content/uploads/2015/10/Kickstarter-dataset-screenshot-1024x418.png 1024w, https://webrobots.io/wp-content/uploads/2015/10/Kickstarter-dataset-screenshot.png 1045w" data-sizes="auto" data-orig-sizes="(max-width: 1045px) 100vw, 1045px" /></a></p>
</div><div class="fusion-clearfix"></div></div></div></div></div>
]]></content:encoded>
					
					<wfw:commentRss>https://webrobots.io/fresh-kickstarter-datasets/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
	</channel>
</rss>

<!--
Performance optimized by W3 Total Cache. Learn more: https://www.boldgrid.com/w3-total-cache/

Page Caching using Disk: Enhanced 
Minified using Disk
Database Caching 35/68 queries in 0.040 seconds using Disk

Served from: webrobots.io @ 2026-04-26 03:05:09 by W3 Total Cache
-->