We have hosted the application html parser in delphi in order to run this application in our online workstations with Wine or directly.


Quick description about html parser in delphi:

THTMLdom is a (Delphi) class with functions to read a HTML source file and dissect it into a tree of THTMLelement. The attributes of the HTML tags are stored in the elements. Functions are provided to select elements on the basis of the attribute values or tag names. The structure of the tree can be shown and it can be rendered as plain text.
The source is plain Delphi pascal, requiring a version that supports Tdictionary. There is no dependency on 3rd party units.
The file to be parsed must have valid HTML4/5 tags. It is not necessary that the HTML is �correct� in the sense that end tags may be wrongly placed or be absent altogether. The speed of processing (reading+parsing) is formidable: 15-40 msec per Mbyte or around 1 msec per 1000 HTML tags.

Features:
  • HTML 4 & 5, also with incorrect tag placings
  • Parsed into a tree with the tag attributes in the nodes
  • Text parts in separate elements
  • Javascript lookalike functions for retrieval
  • Fast


Audience: Information Technology, Developers.

Programming Language: Delphi/Kylix.
Categories:
HTML/XHTML, WWW/HTTP, Software Development

Page navigation:

©2024. Winfy. All Rights Reserved.

By OD Group OU – Registry code: 1609791 -VAT number: EE102345621.