jsoup Java HTML Parser
DeveloperJonathan Hedley
Stable release
1.22.2 / April 20, 2026; 58 days ago (2026-04-20)[1]
Written inJava
Operating systemCross-platform
PlatformJava (JVM)
TypeHTML parser
LicenseMIT license
Websitejsoup.org
Repository

jsoup is an open-source Java library designed to parse, extract, and manipulate data stored in HTML documents.

History

edit

jsoup was created in 2009 by Jonathan Hedley. It is distributed under the MIT License, a permissive free software license similar to the Creative Commons attribution license.

Hedley's avowed intention in writing jsoup was "to deal with all varieties of HTML found in the wild; from pristine and validating, to invalid tag-soup."

Projects powered by jsoup

edit

jsoup is used in a number of current projects,[2] including Google's OpenRefine data-wrangling tool.

See also

edit

References

edit
  1. ^ "jsoup Java HTML Parser release 1.22.2". Retrieved 2026-04-20.
  2. ^ "Jsoup". MVNRepository / F. Rodriguez. 2015-03-08.
edit

📚 Artikel Terkait di Wikipedia

Java (programming language)

Media Framework Java Topology Suite JAXB JaxP Jetty JFreeChart JProfiler JSoup JUNG JUnit LibGDX LiquiBase LWJGL Netty Neuroph ObjectWeb ASM Oracle WebLogic

Beautiful Soup (HTML parser)

heading in headings: print(heading.text.strip()) Comparison of HTML parsers jsoup Nokogiri https://git.launchpad.net/beautifulsoup/tree/CHANGELOG. {{cite

Comparison of HTML parsers

0 · HtmlUnit/htmlunit". GitHub. "Index of /software/BeautifulSoup/bs4/download/4.12". www.crummy.com. "jsoup release 1.22.2 (2026-Apr-20)". jsoup.org.

Outline of the Java programming language

Media Framework Java Topology Suite JAXB JaxP Jetty JFreeChart JProfiler JSoup JUNG JUnit LibGDX LiquiBase LWJGL Netty Neuroph ObjectWeb ASM Oracle WebLogic

List of Java frameworks

specification for building component-based user interfaces for web applications. JSoup Java HTML parser library. Supports extracting and manipulating data using

OpenRefine

denormalizing. Parsing data from web sites: OpenRefine has a URL fetch feature and jsoup HTML parser and DOM engine. Adding data to dataset by fetching it from web

List of JVM languages

Media Framework Java Topology Suite JAXB JaxP Jetty JFreeChart JProfiler JSoup JUNG JUnit LibGDX LiquiBase LWJGL Netty Neuroph ObjectWeb ASM Oracle WebLogic

List of open-source code libraries

Java Java LGPL-3 JFace Java EPL-2.0 JFugue Java Apache-2 jMusic Java GPL-2 jsoup Java MIT JUnit Java EPL 2.0 LibGDX Java Apache 2.0 Log4j Java Apache 2.0