List Info

Thread: RSS feed that Venus cannot handle?




RSS feed that Venus cannot handle?
country flaguser name
United Kingdom
2007-02-28 10:23:35
Hi All,
I have setup a local planet and I noticed that a specific
RSS feed
causes problems with Venus or perhaps a python XML module.

The error message is

ERROR:planet.runner:  File
"/usr/lib/python2.4/site-packages/_xmlplus/dom/minidom.
py", line 210, in
normalize
    child.normalize()

and it appears a few hundred times.

The RSS feed in question is 
htt
p://tkarounos.blogspot.com/feeds/posts/default

Other blogspot RSS feeds appear to be ok.

Cheers,
Simos


-- 
devel mailing list
devellists.planetplanet.org

http://lists.planetplanet.org/mailman/listinfo/devel

Re: RSS feed that Venus cannot handle?
country flaguser name
United States
2007-03-03 07:29:16
Simos Xenitellis wrote:
> Hi All,
> I have setup a local planet and I noticed that a
specific RSS feed
> causes problems with Venus or perhaps a python XML
module.
> 
> The error message is
> 
> ERROR:planet.runner:  File
>
"/usr/lib/python2.4/site-packages/_xmlplus/dom/minidom.
py", line 210, in
> normalize
>     child.normalize()
> 
> and it appears a few hundred times.
> 
> The RSS feed in question is 
> htt
p://tkarounos.blogspot.com/feeds/posts/default
> 
> Other blogspot RSS feeds appear to be ok.

I took a look at it, and the problem is this entry:


http://tkarounos.blogspot.com/2006/11/re-public.html

In particular, look at line 329, somewhere around column
40000.  No, 
that is not a typo, column 40 thousand.

 From what I can tell, the code that sanitizes and
normalizes this feed 
comes to the conclusion that it *must* be in an infinite
loop, as nobody 
would ever create markup that nested.

I'll try to take a look at it again to see if I can scope
the problem 
somewhat - for example, loosing only the content for that
entry, or at 
most, losing only that entry instead of that whole feed.

- Sam Ruby
-- 
devel mailing list
devellists.planetplanet.org

http://lists.planetplanet.org/mailman/listinfo/devel

Re: RSS feed that Venus cannot handle?
country flaguser name
Malaysia
2007-03-04 07:40:14
Thanks Sam,
I'll forward this to the user to update the blog post.
It would be great if such post content could be tamed.

Cheers,
Simos

On Sat, 2007-03-03 at 08:29 -0500, Sam Ruby wrote:
> Simos Xenitellis wrote:
> > Hi All,
> > I have setup a local planet and I noticed that a
specific RSS feed
> > causes problems with Venus or perhaps a python XML
module.
> > 
> > The error message is
> > 
> > ERROR:planet.runner:  File
> >
"/usr/lib/python2.4/site-packages/_xmlplus/dom/minidom.
py", line 210, in
> > normalize
> >     child.normalize()
> > 
> > and it appears a few hundred times.
> > 
> > The RSS feed in question is 
> > htt
p://tkarounos.blogspot.com/feeds/posts/default
> > 
> > Other blogspot RSS feeds appear to be ok.
> 
> I took a look at it, and the problem is this entry:
> 
> 
http://tkarounos.blogspot.com/2006/11/re-public.html
> 
> In particular, look at line 329, somewhere around
column 40000.  No, 
> that is not a typo, column 40 thousand.
> 
>  From what I can tell, the code that sanitizes and
normalizes this feed 
> comes to the conclusion that it *must* be in an
infinite loop, as nobody 
> would ever create markup that nested.
> 
> I'll try to take a look at it again to see if I can
scope the problem 
> somewhat - for example, loosing only the content for
that entry, or at 
> most, losing only that entry instead of that whole
feed.
> 
> - Sam Ruby

-- 
devel mailing list
devellists.planetplanet.org

http://lists.planetplanet.org/mailman/listinfo/devel

Re: RSS feed that Venus cannot handle?
country flaguser name
United States
2007-03-05 09:08:25
Simos Xenitellis wrote:
> Thanks Sam,
> I'll forward this to the user to update the blog post.
> It would be great if such post content could be tamed.

Test case added:

http://intertwingly.net/code/venus/
tests/data/reconstitute/stack_overflow.xml

Explanation of the change: that one element in that one
entry won't get 
normalized.  The atom feed that will be produced will retain
the 
original markup.  Any other templates or filters may not
treat that one 
entry correctly (example: they may display the markup).

- Sam Ruby
-- 
devel mailing list
devellists.planetplanet.org

http://lists.planetplanet.org/mailman/listinfo/devel

[1-4]

about | contact  Other archives ( Real Estate discussion Medical topics )