This link has been bookmarked by 593 people . It was first bookmarked on 02 Mar 2006, by Matt Schneider.
-
21 Oct 12
-
17 May 12
-
10 May 12
-
02 May 12
-
24 Apr 12
-
23 Apr 12
-
20 Apr 12
-
16 Apr 12
-
05 Apr 12
-
31 Mar 12
-
24 Mar 12
-
23 Mar 12
-
22 Mar 12
-
13 Mar 12
-
10 Mar 12
-
07 Mar 12
-
29 Feb 12
-
28 Feb 12
-
22 Feb 12
-
-
17 Feb 12
richtbreak"Beautiful Soup provides a few simple methods and Pythonic idioms for navigating, searching, and modifying a parse tree: a toolkit for dissecting a document and extracting what you need."
-
09 Feb 12
rojwilcoBeautiful Soup is a Python library designed for quick turnaround projects like screen-scraping. Three features make it powerful:
Beautiful Soup provides a few simple methods and Pythonic idioms for navigating, searching, and modifying a parse tree: a toolkit for dissecting a document and extracting what you need. It doesn't take much code to write an application
Beautiful Soup automatically converts incoming documents to Unicode and outgoing documents to UTF-8. You don't have to think about encodings, unless the document doesn't specify an encoding and Beautiful Soup can't autodetect one. Then you just have to specify the original encoding.
Beautiful Soup sits on top of popular Python parsers like lxml and html5lib, allowing you to try out different parsing strategies or trade speed for flexibility.
Beautiful Soup parses anything you give it, and does the tree traversal stuff for you. You can tell it "Find all the links", or "Find all the links of class externalLink", or "Find all the links whose urls match "foo.com", or "Find the table heading that's got bold text, then give me that text."
Valuable data that was once locked up in poorly-designed websites is now within your reach. Projects that would have taken hours take only minutes with Beautiful Soup. -
08 Feb 12
-
23 Jan 12
-
04 Jan 12
-
28 Dec 11
-
21 Nov 11
-
23 Oct 11
-
06 Oct 11
-
27 Sep 11
-
26 Sep 11
-
04 Sep 11
-
28 Aug 11
-
23 Aug 11
-
11 Aug 11
-
10 Aug 11
-
05 Aug 11
-
02 Aug 11
-
18 Jul 11
-
09 Jul 11
-
20 Jun 11
-
07 Jun 11
-
24 May 11
-
21 May 11
John Banbury"Beautiful Soup is a Python HTML/XML parser designed for quick turnaround projects like screen-scraping"
-
14 Apr 11
-
12 Apr 11
-
28 Mar 11
-
08 Mar 11
-
07 Mar 11
-
05 Mar 11
-
03 Mar 11
-
17 Feb 11
-
16 Feb 11
-
14 Feb 11
-
07 Feb 11
-
29 Jan 11
-
24 Jan 11
Alexander Tsang"You didn't write that awful page. You're just trying to get some data out of it. Right now, you don't really care what HTML is supposed to look like. Neither does this parser. "
-
13 Jan 11
-
03 Jan 11
-
29 Dec 10
mANIA pHOBICBeautiful Soup is a Python HTML/XML parser designed for quick turnaround projects like screen-scraping. Beautiful Soup parses anything you give it, and does the tree traversal stuff for you.
-
16 Dec 10
elnajayYou didn't write that awful page. You're just trying to get some data out of it. Right now, you don't really care what HTML is supposed to look like.
Neither does this parser.html xml python parser programming web tools library development
-
11 Dec 10
-
24 Nov 10
-
22 Nov 10
-
21 Nov 10
-
18 Nov 10
Alex Yakovlev"Beautiful Soup is a Python HTML/XML parser designed for quick turnaround projects like screen-scraping. Three features make it powerful:
1. Beautiful Soup won't choke if you give it bad markup. It yields a parse tree that makes approximately as much sense as your original document. This is usually good enough to collect the data you need and run away.
2. Beautiful Soup provides a few simple methods and Pythonic idioms for navigating, searching, and modifying a parse tree: a toolkit for dissecting a document and extracting what you need. You don't have to create a custom parser for each application.
3. Beautiful Soup automatically converts incoming documents to Unicode and outgoing documents to UTF-8. You don't have to think about encodings, unless the document doesn't specify an encoding and Beautiful Soup can't autodetect one. Then you just have to specify the original encoding.
Beautiful Soup parses anything you give it, and does the tree traversal stuff for you. You can tell it "Find all the links", or "Find all the links of class externalLink", or "Find all the links whose urls match "foo.com", or "Find the table heading that's got bold text, then give me that text."
Valuable data that was once locked up in poorly-designed websites is now within your reach. Projects that would have taken hours take only minutes with Beautiful Soup. "opensource python development programming library html parser xml tools web scraping
-
13 Nov 10
-
27 Oct 10
-
20 Oct 10
-
13 Oct 10
-
09 Oct 10
-
30 Sep 10
-
24 Sep 10
-
15 Sep 10
-
26 Aug 10
Gjm GLingBeautiful Soup is a Python HTML/XML parser designed for quick turnaround projects like screen-scraping. Three features make it powerful:
1. Beautiful Soup won't choke if you give it bad markup. It yields a parse tree that makes approximately as much s -
18 Aug 10
-
16 Jul 10
Greg SavenBeautiful Soup is a Python HTML/XML parser designed for quick turnaround projects like screen-scraping.
-
04 Jul 10
-
22 Jun 10
Pedro ÂngeloA Python HTML/XML parser designed for quick turnaround projects like screen-scraping
-
03 Jun 10
Lisa Spiro"Beautiful Soup is a Python HTML/XML parser designed for quick turnaround projects like screen-scraping. Three features make it powerful: "
-
30 May 10
-
15 May 10
-
11 May 10
-
10 May 10
-
06 May 10
-
04 May 10
-
03 May 10
-
02 May 10
-
21 Apr 10
-
01 Apr 10
Martin DanteYou didn't write that awful page. You're just trying to get some data out of it. Right now, you don't really care what HTML is supposed to look like.
Neither does this parser. -
31 Mar 10
-
22 Mar 10
-
12 Mar 10
-
03 Mar 10
-
26 Feb 10
Page Comments
Would you like to comment?
Join Diigo for a free account, or sign in if you are already a member.