CyberNeko HTML Parser 0.9.5

About

NekoHTML is a simple HTML scanner and tag balancer that enables application programmers to parse HTML documents and access the information using standard XML interfaces. The parser can scan HTML files and "fix up" many common mistakes that human (and computer) authors make in writing HTML documents. NekoHTML adds missing parent elements; automatically closes elements with optional end tags; and can handle mismatched inline element tags.

NekoHTML is written using the Xerces Native Interface (XNI) that is the foundation of the Xerces2 implementation. This enables you to use the NekoHTML parser with existing XNI tools without modification or rewriting code.

License Agreement

The NekoHTML parser is distributed under an Apache-style license and is currently being considered as a sub-project of the Apache Xerces project. If accepted, the license and development of NekoHTML will move to Apache. However, this will not affect the ability of the parser to be freely used as specified by the current license.

For specific license details, please refer to the LICENSE file.

Download

The NekoHTML parser includes complete Java source code and documentation. You can download the latest version from the following location:

Requirements and Limitations

This version of NekoHTML requires the following:

This version has the following limitations:

For a more complete list of items to be done, please refer to the Todo Items.

Contact Information

Andy Clark <andyc@apache.org>