watir phantomj mechanizeを使わずにJavaScript実行後のを取得 scrape something after JS has changed the DOM
https://gist.github.com/julionc/7476620
http://qiita.com/haman29@github/items/0296e449483a6c1898ea
https://github.com/watir/watir-webdriver/issues/222
phantomjsをubuntu12.04に
$:sudo apt-get install pahntomjs
でインストールすると
$:phantomjs --version
1.4.0
ところが"PhantomJS added WebDriver support in version 1.8.0."だそうなのでremove後
sudo apt-get update
sudo apt-get install build-essential chrpath libssl-dev libxft-dev
Install these packages needed by PhantomJS to work correctly.
sudo apt-get install libfreetype6 libfreetype6-dev
sudo apt-get install libfontconfig1 libfontconfig1-dev
Get it from the PhantomJS(http://phantomjs.org/) website.
cd ~
export PHANTOM_JS="phantomjs-1.9.7-linux-x86_64"
wget https://bitbucket.org/ariya/phantomjs/downloads/$PHANTOM_JS.tar.bz2
Once downloaded, move compress file to /usr/local/share/, and create symlinks:
sudo mv $PHANTOM_JS.tar.bz2 /usr/local/share/
cd /usr/local/share/
sudo tar xvjf $PHANTOM_JS.tar.bz2
sudo ln -sf /usr/local/share/$PHANTOM_JS/bin/phantomjs /usr/local/share/phantomjs
sudo ln -sf /usr/local/share/$PHANTOM_JS/bin/phantomjs /usr/local/bin/phantomjs
sudo ln -sf /usr/local/share/$PHANTOM_JS/bin/phantomjs /usr/bin/phantomjs
なお
gem install phantomjs
も忘れずに!
その後に
b = Watir::Browser.new(:phantomjs)
b.goto URL
doc = Nokogiri::HTML(b.html)
また実際のブラウザーを使うときには
sudo apt-get install xvfb
gem install headless
後
require 'headless'
require 'watir-webdriver'
require 'nokogiri'
vDisplay = Headless.new
vDisplay.start
ff=Watir::Browser.new :firefox, :profile => "default"
ff.goto(targetURL)
body=Nokogiri::HTML(pHTML)
ff.close
vDisplay.destroy