|
List Info
Thread: OT What's next
|
|
| OT What's next |

|
2006-11-29 19:25:53 |
On Wed, 29 Nov 2006, Alan Gauld wrote:
> But for the Python programmer I'd consider Borland
Delphi, based
> on Pascal. Pascal is much more readable and Pythonic
than C
> and allows the same level of access to the underlying
hardware
> (and assembler when needed).
I'm going to apologize for taking this even further
off-topic, but, if
the OP is looking at Delphi, he should know that Borland
provides a
version of Delphi, called Turbo Delphi Explorer, that you
can download and
use for free. We also offer a priced
"Professional" version, but for the
OP's interests, the Explorer edition will probably work just
fine.
http://www.turboexplore
r.com/
http://www.borland.com/downloads/download_turbo.html
Full disclosure: I say "we" above, because I *am*
affiliated with Borland:
I'm Borland's in-house attorney for intellectual property
and product
matters. (What's an attorney doing on a Python list? I was
a developer
before I was an attorney, and still play around a little bit
here and
there, and I usually use Python when I do.)
_______________________________________________
Tutor maillist - Tutor python.org
http://
mail.python.org/mailman/listinfo/tutor
|
|
| Beautiful Soup |

|
2006-11-29 19:35:53 |
|
Hi,
I am using beautiful soup for extracting links from a web page. Most pages use relative links in their pages which is causing a problem. Is there any library to extract complete links or do i have to parse this myself?
Thanks, Shitiz
Terry Carroll <carroll tjc.com> wrote: On Wed, 29 Nov 2006, Alan Gauld wrote:
> But for the Python programmer I'd consider Borland Delphi, based > on Pascal. Pascal is much more readable and Pythonic than C > and allows the same level of access to the underlying hardware > (and assembler when needed).
I'm going to apologize for taking this even further off-topic, but, if the OP is looking at Delphi, he should know that Borland provides a version of Delphi, called Turbo Delphi Explorer, that you can download and use for free. We also
offer a priced "Professional" version, but for the OP's interests, the Explorer edition will probably work just fine.
http://www.turboexplorer.com/ http://www.borland.com/downloads/download_turbo.html
Full disclosure: I say "we" above, because I *am* affiliated with Borland: I'm Borland's in-house attorney for intellectual property and product matters. (What's an attorney doing on a Python list? I was a developer before I was an attorney, and still play around a little bit here and there, and I usually use Python when I do.)
_______________________________________________ Tutor maillist - Tutor python.org http://mail.python.org/mailman/listinfo/tutor
Cheap Talk? Check out Yahoo! Messenger's low PC-to-Phone call rates. |
| Beautiful Soup |

|
2006-11-29 19:53:39 |
On 11/30/06, Shitiz Bansal <shitizb yahoo.com> wrote:
> I am using beautiful soup for extracting links from a
web page.
> Most pages use relative links in their pages which is
causing a problem. Is
> there any library to extract complete links or do i
have to parse this
> myself?
>
Beautiful Soup can also extract text which is present on the
page. If
there are no complete links no library can do that for you.
But since
you are reaching a certain web page to extract you already
have that
URL information with you. All you have to do then is to
prefix it to
each extracted URL.
HTH
akash
_______________________________________________
Tutor maillist - Tutor python.org
http://
mail.python.org/mailman/listinfo/tutor
|
|
| Beautiful Soup |

|
2006-11-29 20:07:15 |
* Akash <akashmahajan gmail.com> [061129
20:54]:
> On 11/30/06, Shitiz Bansal <shitizb yahoo.com> wrote:
> > I am using beautiful soup for extracting links
from a web page.
> > Most pages use relative links in their pages which
is causing a problem. Is
> > there any library to extract complete links or do
i have to parse this
> > myself?
> >
>
> Beautiful Soup can also extract text which is present
on the page. If
> there are no complete links no library can do that for
you. But since
> you are reaching a certain web page to extract you
already have that
> URL information with you. All you have to do then is to
prefix it to
> each extracted URL.
Take a look at urlparse.urljoin from the standard library.
Andreas
_______________________________________________
Tutor maillist - Tutor python.org
http://
mail.python.org/mailman/listinfo/tutor
|
|
| Beautiful Soup |

|
2006-11-29 20:42:29 |
|
Thanks, urlparse.urljoin did the trick. Akash- the problem with directly prefixing url to the link is that the url most of the times contains not just the page address but also parameters and fragments.
Andreas Kostyrka <andreas kostyrka.org> wrote: * Akash gmail.com> [061129 20:54]: > On 11/30/06, Shitiz Bansal yahoo.com> wrote: > > I am using beautiful soup for extracting links from a web page. > > Most pages use relative links in their pages which is causing a problem. Is > > there any library to extract complete links or do i have to parse this > > myself? > > > > Beautiful Soup can also extract text which is present on the page. If > there are no complete links no library can do that for you. But since > you are reaching a certain
web page to extract you already have that > URL information with you. All you have to do then is to prefix it to > each extracted URL. Take a look at urlparse.urljoin from the standard library.
Andreas yahoo.com>gmail.com>
Access over 1 million songs - Yahoo! Music Unlimited. |
[1-5]
|
|
|
about | contact Other archives ( Real Estate discussion Medical topics )
|