Anyone know how to mine the Pop Sci PDFs here?

OK this is hardly automated but, hey if you want them sometimes you need to do a little work

Go here and download/install this http://www.gbooksdownloader.com/

***NOTE the above program tries to install a bunch of other software at the end of the install package, DECLINE all that additional stuff*** PAY ATTENTION TO THE INSTALL!

Next go to this link http://books.google.com/books/serial/ISSN:01617370?rview=0&lr&sa=N&start=1

That will bring up the most current issues from 2009, and what not...

Right click on each issue and copy URL, paste them into the program you downloaded above, make sure to bump the resolution to the max in the capture software before you save, and also PDF if you don't want a bunch of individual images of the pages...

You can then browse to more issued just like any Google search using the page numbers at the bottom...

BUT DO NOTE, that the Google search page numbers only let you browse to page 100 aka May 1924, from that point you will need to fake the page number...

Page search 100 that gets you to May 1924 url is

Code:

http://books.google.com/books/serial/ISSN:01617370?rview=0&lr=&sa=N&start=990

You need to change the last number up by 10 to obviously get the next 10 issues so change it to

Code:

http://books.google.com/books/serial/ISSN:01617370?rview=0&lr=&sa=N&start=1000

And so on until you get all 1563 issues...

Now, this could be automated with a script, but it really won't take that long to do it manually, ok it will take some time but hardly that much... Also beware that if a script is written it should only download a single page at a time, if Google detects the same IP downloading multiple pages at that same time they will ban the IP as a bot harvester...

Good luck, maybe it can be a joint effort where a few people volunteer to download a decade and share it with each other?

	Welcome, Guest. Please login or register. Did you miss your activation email?	March 22, 2026, 08:19:37 08:19
		Login with username, password and session length

	Author	Topic: Anyone know how to mine the Pop Sci PDFs here? (Read 6263 times)
0 Members and 1 Guest are viewing this topic.