This link has been bookmarked by 66 people . It was first bookmarked on 05 May 2008, by Arrix Z.
-
11 Jan 12
-
01 Oct 11
-
14 Aug 11
-
04 Aug 11
Hendy Irawan"
I did some digging to see what people had previously built, but the landscape was pretty bleak. The only one that I could find was one made by Erik Arvidsson - a simple SAX-style HTML parser. Considering that this contained only the most basic parsing - and none of the actual, complicated, HTML logic there was still a lot of work left to be done.
(I also contemplated porting the HTML 5 parser, wholesale, but that seemed like a herculean effort.)
However, the result is one that I'm quite pleased with. It won't match the compliance of html5lib, nor the speed of a pure XML parser, but it's able to get the job done with little fuss - while still being highly portable."javascript html parser programming webdev library xml development
-
07 Apr 11
-
04 Apr 11
-
23 Feb 11
-
26 Jan 11
-
28 Oct 10
-
05 Oct 10
-
04 Oct 10
-
16 Apr 10
-
07 Apr 10
-
21 Jan 10
-
24 Aug 09
-
28 May 09
-
23 May 09
-
21 May 09
-
26 Apr 09
-
12 Jan 09
-
28 Nov 08
-
02 Jul 08
-
26 May 08
-
22 May 08
-
19 May 08
-
13 May 08
-
12 May 08
-
08 May 08
-
07 May 08
-
I've been toying with the ability to port env.js to other platforms (Spidermonkey derivatives and the ECMAScript 4 Reference Implementation) and if I were to do so I would need an HTML parser. Because of this fact it became easiest to just write an HTML parser in pure JavaScript.
I did some digging to see what people had previously built, but the landscape was pretty bleak. The only one that I could find was one made by Erik Arvidsson - a simple SAX-style HTML parser. Considering that this contained only the most basic parsing - and none of the actual, complicated, HTML logic there was still a lot of work left to be done.
(I also contemplated porting the HTML 5 parser, wholesale, but that seemed like a herculean effort.)
However, the result is one that I'm quite pleased with. It won't match the compliance of html5lib, nor the speed of a pure XML parser, but it's able to get the job done with little fuss - while still being highly portable.
-
-
06 May 08
-
05 May 08
-
João Camposjavascript library to parse html documents (converts to dom and xml). useful as browsers don't always support html parsing in javascript
-
Jason WehmhoenerJohn Resig has written an HTML parser in JavaScript for use with Rhino.
Would you like to comment?
Join Diigo for a free account, or sign in if you are already a member.