List Info

Thread: html parser




html parser
user name
2006-06-25 14:18:38
Hi,
I'm trying to find a html parser and came across this library. When i'm looking at c++ bindings for libxml2, i found that, the html parser has no bindings in c++. Is it true?,  or am i wrong? please point me in correct direction.

Also, if there're no html parser bindings, could somebody please give me pointers on parsing html file. I do not see any html parser examples for libxml2. (I've visited only libxml2 web page)

Thanks in advance,
Surya
html parser
user name
2006-06-25 20:40:34
On 25.06.06 19:48:38, Surya Kiran Gullapalli wrote:
> Also, if there're no html parser bindings, could
somebody please give me
> pointers on parsing html file. I do not see any html
parser examples for
> libxml2. (I've visited only libxml2 web page)

The C-API provides various functions to parse a html file,
for example:

htmlDocPtr      htmlParseFile           (const char *
filename, 
                                         const char *
encoding)
htmlDocPtr      htmlReadDoc             (const xmlChar *
cur, 
                                         const char * URL, 
                                         const char *
encoding, 
                                         int options)
htmlDocPtr      htmlReadFd              (int fd, 
                                         const char * URL, 
                                         const char *
encoding, 
                                         int options)
htmlDocPtr      htmlReadFile            (const char *
filename, 
                                         const char *
encoding, 
                                         int options)
htmlDocPtr      htmlReadIO             
(xmlInputReadCallback ioread, 
                                        
xmlInputCloseCallback ioclose, 
                                         void * ioctx, 
                                         const char * URL, 
                                         const char *
encoding, 
                                         int options)
htmlDocPtr      htmlReadMemory          (const char *
buffer, 
                                         int size, 
                                         const char * URL, 
                                         const char *
encoding, 
                                         int options)

Klick on "API Menu" and then on HTMLParser for
more information.

Andreas

-- 
Excellent day to have a rotten day.
_______________________________________________
xml mailing list, project page  http://xmlsoft.org/
xmlgnome.org
http://mai
l.gnome.org/mailman/listinfo/xml
[1-2]

about | contact  Other archives ( Real Estate discussion Medical topics )