Phantom.js and casper.js If you can't get the data from the endpoints the javasc...

frabcus · on Dec 9, 2012

I've started playing with zombie.js recently as well - much lighter and faster than the ones that instrument a completely full browser. But has a full Javascript engine.

jonpaul · on Dec 9, 2012

zombie.js is not a full browser. It's a poor emulation using jsdom as its backing. http://zombie.labnotes.org/guts Beware, for some applications, jsdom is super buggy.

freshhawk · on Dec 9, 2012

That's really interesting, thanks.

I worry that it's not going to replicate a real browser accurately enough, but I'm excited to try it out a bit.

jonpaul · on Dec 9, 2012

Your worry is correct. http://news.ycombinator.com/item?id=4896054 I've tried scraping with it, and it failed miserably on some sites.

frabcus · on Dec 10, 2012

Yeah, it's not mature enough yet.

We're also trying it for integration tests, as it is much quicker than Phantom or Selenium. Even there, where we control the standards-compliant site, it isn't quite good enough yet.

Would love to see more people helping make it so, though!

suldan34 · on Dec 9, 2012

upvote for casperjs - it's definitely the best system I've come across for scraping javascript / ajax contents.