mlandauer / fed_mp_data

Scrapes www.aph.gov.au

Home – Parliament of Australia


Contributors richygit

Last run failed with status code 1.

Console output of last run

Injecting configuration and compiling... [1G-----> Ruby app detected [1G-----> Compiling Ruby [1G-----> Using Ruby version: ruby-2.0.0 [1G-----> Installing dependencies using 1.7.12 [1G Running: bundle install --without development:test --path vendor/bundle --binstubs vendor/bundle/bin -j4 --deployment [1G Fetching gem metadata from https://rubygems.org/......... [1G Installing Ascii85 1.0.2 [1G Installing afm 0.2.2 [1G Installing rake 10.1.0 [1G Installing hashery 2.1.1 [1G Installing httpclient 2.6.0.1 [1G Installing mime-types 2.3 [1G Installing net-http-digest_auth 1.4 [1G Installing net-http-persistent 2.9.4 [1G Installing mini_portile 0.6.0 [1G Installing ntlm-http 0.1.1 [1G Installing unf_ext 0.0.6 [1G Installing webrobots 0.1.1 [1G Installing ruby-rc4 0.1.5 [1G Using bundler 1.7.12 [1G Installing ttfunk 1.3.0 [1G Installing unf 0.1.4 [1G Installing pdf-reader 1.3.3 [1G Installing domain_name 0.5.21 [1G Installing http-cookie 1.0.2 [1G Installing sqlite3 1.3.10 [1G Installing sqlite_magic 0.0.5 [1G Installing scraperwiki 3.0.2 [1G Installing nokogiri 1.6.3.1 [1G Installing mechanize 2.7.3 [1G Your bundle is complete! [1G Gems in the groups development and test were not installed. [1G It was installed into ./vendor/bundle [1G Post-install message from pdf-reader: [1G ******************************************** [1G v1.0.0 of PDF::Reader introduced a new page-based API. There are extensive [1G examples showing how to use it in the README and examples directory. [1G For detailed documentation, check the rdocs for the PDF::Reader, [1G PDF::Reader::Page and PDF::Reader::ObjectHash classes. [1G The old API is marked as deprecated but will continue to work with no [1G visible warnings for now. [1G ******************************************** [1G Bundle completed (23.51s) [1G Cleaning up the bundler cache. [1G [1G ###### WARNING: [1G You have not declared a Ruby version in your Gemfile. [1G To set your Ruby version add this line to your Gemfile: [1G ruby '2.0.0' [1G # See https://devcenter.heroku.com/articles/ruby-versions for more information. [1G [1G-----> Discovering process types [1G Procfile declares types -> scraper [1G Default process types for Ruby -> rake, console Injecting scraper and running... I, [2015-06-15T10:29:36.836259 #11] INFO -- : Scraping CSV I, [2015-06-15T10:29:39.465684 #11] INFO -- : Scraping PDF /app/vendor/bundle/ruby/2.0.0/gems/pdf-reader-1.3.3/lib/pdf/reader/page_layout.rb:17:in `initialize' : undefined method `[]' for #<PDF::Reader::Reference:0x007f658d113768 @id=1134, @gen=0> (NoMethodError) from /app/vendor/bundle/ruby/2.0.0/gems/pdf-reader-1.3.3/lib/pdf/reader/page_text_receiver.rb:49:in `new' from /app/vendor/bundle/ruby/2.0.0/gems/pdf-reader-1.3.3/lib/pdf/reader/page_text_receiver.rb:49:in `content' from /app/vendor/bundle/ruby/2.0.0/gems/pdf-reader-1.3.3/lib/pdf/reader/page.rb:76:in `text' from /app/pdf_scraper.rb:20:in `block in scrape_pdf' from /app/pdf_scraper.rb:19:in `each' from /app/pdf_scraper.rb:19:in `scrape_pdf' from /app/pdf_scraper.rb:13:in `scrape' from /app/scraper_main.rb:48:in `main' from scraper.rb:3:in `<main>'

Statistics

Total run time: 1 minute

Total cpu time used: less than 5 seconds

Total disk space used: 112 MB

History

  • Manually ran revision 75bce6df and failed .
    nothing changed in the database
    3 pages scraped
  • Created on morph.io

Scraper code

fed_mp_data