HtmlUnit

From Seo Wiki - Search Engine Optimization and Programming Languages
Jump to navigationJump to search
HtmlUnit
File:Htmlunit logo.png
Developer(s) Gargoyle Software
Initial release 22, 2002 (2002-05-22)
Stable release 2.6 / September 2, 2009; 474960866 ago
Written in Java
Operating system Cross-platform (JVM)
Available in English
Type Web browser
License Apache License 2.0
Website htmlunit.sourceforge.net

HtmlUnit is a pure Java headless web browser, which allows high-level manipulation of web pages, such as filling forms, clicking hyperlinks, accessing attributes and values of specific elements within the pages. You do not have to create lower-level requests of TCP/IP or HTTP, but just getPage(url), find a hyperlink, click() and you have all the HTML, JavaScript, and AJAX are automatically processed.

The most common use of HtmlUnit is test automation of web pages (even with complex JavaScript libraries, for instance Google Web Toolkit 1.5 tests now pass), but sometimes it can be used for web scraping, or downloading website content.

Version 2.0 includes many new enhancements such as W3C DOM implementation, Java 5 features, better XPath support, and improved handling for incorrect HTML, in addition to the usual JavaScript various enhancements, while version 2.1 mainly focuses on tuning some performance issues reported by users.

External links

If you like SEOmastering Site, you can support it by - BTC: bc1qppjcl3c2cyjazy6lepmrv3fh6ke9mxs7zpfky0 , TRC20 and more...