TagSoup is a SAX-compliant parser written in Java that, instead of parsing well-formed or valid XML, parses HTML as it is found in the wild: poor, nasty and brutish, though quite often far from short. TagSoup is designed for people who have to process this stuff using some semblance of a rational application design. By providing a SAX interface, it allows standard XML tools to be applied to even the worst HTML. TagSoup also includes a command-line processor that reads HTML files and can generate either clean HTML or well-formed XML that is a close approximation to XHTML.
org.ccil.cowan.tagsoup:tagsoup is a tool in the Languages category of a tech stack.
No pros listed yet.
No cons listed yet.
What are some alternatives to org.ccil.cowan.tagsoup:tagsoup?
JUnit is a unit testing framework for Java, created by Erich Gamma and Kent Beck.
Starter for building web, including RESTful, applications using Spring MVC. Uses Tomcat as the default embedded container.
Starter for testing Spring Boot applications with libraries including JUnit, Hamcrest and Mockito.
Apache Commons Lang, a package of Java utility classes for the classes that are in java.lang's hierarchy, or are considered to be so standard as to justify existence in java.lang.