Recently we updated our Kickstarter robot to crawl project subcategories. This allows us to collect a richer dataset, for example on 2015-12-17 run robot collected data about 144,263 projects with a running time only 2 hours! We also started presenting it in the JSON streaming format which is just a line delimited JSON. Previously we used to stuff all projects into JSON array and the downside of it was that user would have to read the entire large JSON file into memory before any kind of processing starts. with JSON streaming it is possible to read one line at a time.
Data is posted in the usual place.
I downloaded the most current data set, but I have some difficulties interpreting the variables.
deadline, state_changed_at, created_at, and launched_at have codes and not the normal dates. How can I interpret these?
Also I don’t understand the following variables: disable_communication and staff_pick.
Your help is much appreciated.
Nice dataset but please, put better descriptions for columns.