List Info

Thread: OT What's next




OT What's next
user name
2006-11-29 19:25:53
On Wed, 29 Nov 2006, Alan Gauld wrote:

> But for the Python programmer I'd consider Borland
Delphi, based
> on Pascal. Pascal is much more readable and Pythonic
than C
> and allows the same level of access to the underlying
hardware
> (and assembler when needed).

I'm going to apologize for taking this even further
off-topic, but, if 
the OP is looking at Delphi, he should know that Borland
provides a
version of Delphi, called Turbo Delphi Explorer, that you
can download and 
use for free.  We also offer a priced
"Professional" version, but for the 
OP's interests, the Explorer edition will probably work just
fine.

 http://www.turboexplore
r.com/
 
http://www.borland.com/downloads/download_turbo.html

Full disclosure: I say "we" above, because I *am*
affiliated with Borland: 
I'm Borland's in-house attorney for intellectual property
and product 
matters.  (What's an attorney doing on a Python list?  I was
a developer 
before I was an attorney, and still play around a little bit
here and 
there, and I usually use Python when I do.)

_______________________________________________
Tutor maillist  -  Tutorpython.org
http://
mail.python.org/mailman/listinfo/tutor
Beautiful Soup
user name
2006-11-29 19:35:53
Hi,

I am using beautiful soup for extracting links from a web page.
Most pages use relative links in their pages which is causing a problem. Is there any library to extract complete links or do i have to parse this myself?

Thanks,
Shitiz

Terry Carroll <carrolltjc.com>; wrote:
On Wed, 29 Nov 2006, Alan Gauld wrote:

&gt; But for the Python programmer I'd consider Borland Delphi, based
> on Pascal. Pascal is much more readable and Pythonic than C
> and allows the same level of access to the underlying hardware
&gt; (and assembler when needed).

I'm going to apologize for taking this even further off-topic, but, if
the OP is looking at Delphi, he should know that Borland provides a
version of Delphi, called Turbo Delphi Explorer, that you can download and
use for free. We also offer a priced "Professional" version, but for the
OP's interests, the Explorer edition will probably work just fine.

http://www.turboexplorer.com/
http://www.borland.com/downloads/download_turbo.html

Full disclosure: I say "we" above, because I *am* affiliated with Borland:
I'm Borland's in-house attorney for intellectual property and product
matters. (What's an attorney doing on a Python list? I was a developer
before I was an attorney, and still play around a little bit here and
there, and I usually use Python when I do.)

_______________________________________________
Tutor maillist - Tutorpython.org
http://mail.python.org/mailman/listinfo/tutor


Cheap Talk? Check out Yahoo! Messenger's low PC-to-Phone call rates.
Beautiful Soup
user name
2006-11-29 19:53:39
On 11/30/06, Shitiz Bansal <shitizbyahoo.com> wrote:
> I am using beautiful soup for extracting links from a
web page.
> Most pages use relative links in their pages which is
causing a problem. Is
> there any library to extract complete links or do i
have to parse this
> myself?
>

Beautiful Soup can also extract text which is present on the
page. If
there are no complete links no library can do that for you.
But since
you are reaching a certain web page to extract you already
have that
URL information with you. All you have to do then is to
prefix it to
each extracted URL.

HTH

akash
_______________________________________________
Tutor maillist  -  Tutorpython.org
http://
mail.python.org/mailman/listinfo/tutor
Beautiful Soup
user name
2006-11-29 20:07:15
* Akash <akashmahajangmail.com> [061129
20:54]:
> On 11/30/06, Shitiz Bansal <shitizbyahoo.com> wrote:
> > I am using beautiful soup for extracting links
from a web page.
> > Most pages use relative links in their pages which
is causing a problem. Is
> > there any library to extract complete links or do
i have to parse this
> > myself?
> >
> 
> Beautiful Soup can also extract text which is present
on the page. If
> there are no complete links no library can do that for
you. But since
> you are reaching a certain web page to extract you
already have that
> URL information with you. All you have to do then is to
prefix it to
> each extracted URL.
Take a look at urlparse.urljoin from the standard library.

Andreas
_______________________________________________
Tutor maillist  -  Tutorpython.org
http://
mail.python.org/mailman/listinfo/tutor
Beautiful Soup
user name
2006-11-29 20:42:29
Thanks, urlparse.urljoin did the trick.
Akash- the problem with directly prefixing url to the link is that the url most of the times contains not just the page address but also parameters and fragments.

Andreas Kostyrka <andreaskostyrka.org> wrote:
* Akash gmail.com> [061129 20:54]:
&gt; On 11/30/06, Shitiz Bansal yahoo.com> wrote:
>; > I am using beautiful soup for extracting links from a web page.
> > Most pages use relative links in their pages which is causing a problem. Is
> > there any library to extract complete links or do i have to parse this
> > myself?
&gt; >
>
> Beautiful Soup can also extract text which is present on the page. If
> there are no complete links no library can do that for you. But since
> you are reaching a certain web page to extract you already have that
> URL information with you. All you have to do then is to prefix it to
> each extracted URL.
Take a look at urlparse.urljoin from the standard library.

Andreas
yahoo.com>
gmail.com>


Access over 1 million songs - Yahoo! Music Unlimited.
[1-5]

about | contact  Other archives ( Real Estate discussion Medical topics )