[ http://issues.apache.org/jira/browse/NUTCH-110?page=all
a> ]
Stefan Neufeind updated NUTCH-110:
----------------------------------
Attachment: fixIllegalXmlChars08.patch
Since original patch didn't cleanly apply for me on 0.8-dev
(nightly-2006-05-20) I re-did it for 0.8 ...
With this patch the XML is fine. Without I had big trouble
parsing the RSS-feed in another application.
> OpenSearchServlet outputs illegal xml characters
> ------------------------------------------------
>
> Key: NUTCH-110
> URL: http:/
/issues.apache.org/jira/browse/NUTCH-110
> Project: Nutch
> Type: Bug
> Components: searcher
> Versions: 0.7
> Environment: linux, jdk 1.5
> Reporter: stack archive.org
> Attachments: NUTCH-110-version2.patch,
fixIllegalXmlChars.patch, fixIllegalXmlChars08.patch
>
> OpenSearchServlet does not check text-to-output for
illegal xml characters; dependent on search result, its
possible for OSS to output xml that is not well-formed. For
example, if text has the character FF character in it -- --
i.e. the ascii character at position (decimal) 12 -- the
produced XML will show the FF character as '' The
character/entity '' is not legal in XML according
to h
ttp://www.w3.org/TR/2000/REC-xml-20001006#NT-Char.
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the
administrators:
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
http://www.atl
assian.com/software/jira
|