41 Commits (1da055ee0dd11b8c16644c27381a2b20bf0ff205)

Author SHA1 Message Date
R David Murray 44b548dda8 #27364: fix "incorrect" uses of escape character in the stdlib. 10 years ago
Martin Panter 46f50726a0 Issue #27076: Doc, comment and tests spelling fixes 10 years ago
Ezio Melotti 6f2bb98966 #23144: Make sure that HTMLParser.feed() returns all the data, even when convert_charrefs is True. 11 years ago
Ezio Melotti 6fc16d81af #21047: set the default value for the *convert_charrefs* argument of HTMLParser to True. Patch by Berker Peksag. 12 years ago
Ezio Melotti 73a4359eb0 #15114: the strict mode and argument of HTMLParser, HTMLParser.error, and the HTMLParserError exception have been removed. 12 years ago
Ezio Melotti f27b9a741a #20288: fix handling of invalid numeric charrefs in HTMLParser. 12 years ago
Ezio Melotti 95401c5f6b #13633: Added a new convert_charrefs keyword arg to HTMLParser that, when True, automatically converts all character references. 12 years ago
Ezio Melotti f6de9eb2bb #19688: add back and deprecate the internal HTMLParser.unescape() method. 12 years ago
Ezio Melotti 4a9ee26750 #2927: Added the unescape() function to the html module. 12 years ago
Ezio Melotti 7165d8b9ba #19480: HTMLParser now accepts all valid start-tag names as defined by the HTML5 standard. 12 years ago
Ezio Melotti 88ebfb129b #15114: The html.parser module now raises a DeprecationWarning when the strict argument of HTMLParser or the HTMLParser.error method are used. 12 years ago
Ezio Melotti 8e596a765c #17802: Fix an UnboundLocalError in html.parser. Initial tests by Thomas Barlow. 13 years ago
Ezio Melotti 1698babd1b #14679: add an __all__ (that contains only HTMLParser) to html.parser. 13 years ago
Ezio Melotti 46495182d0 #15156: HTMLParser now uses the new "html.entities.html5" dictionary. 14 years ago
Ezio Melotti 3861d8b271 #15114: the strict mode of HTMLParser and the HTMLParseError exception are deprecated now that the parser is able to parse invalid markup. 14 years ago
Ezio Melotti 0780b6bc58 #14538: HTMLParser can now parse correctly start tags that contain a bare /. 14 years ago
Ezio Melotti 29877e8e04 HTMLParser is now able to handle slashes in the start tag. 14 years ago
Ezio Melotti e31ddedb0e Fix an index and clean up comments. 14 years ago
Ezio Melotti f4ab491901 Improve handling of declarations in HTMLParser. 14 years ago
Ezio Melotti 5211ffe4df #13993: HTMLParser is now able to handle broken end tags when strict=False. 14 years ago
Ezio Melotti fa3702dc28 #13960: HTMLParser is now able to handle broken comments when strict=False. 14 years ago
Ezio Melotti 15cb489234 #13358: HTMLParser now calls handle_data only once for each CDATA. 14 years ago
Ezio Melotti c2fe57762b #1745761, #755670, #13357, #12629, #1200313: improve attribute handling in HTMLParser. 14 years ago
Ezio Melotti 7de56f6a04 #670664: Fix HTMLParser to correctly handle the content of ``<script>...</script>`` and ``<style>...</style>``. 14 years ago
Ezio Melotti f50ffa94ab #13273: fix a bug that prevented HTMLParser to properly detect some tags when strict=False. 14 years ago
Ezio Melotti d9e0b068af #12888: Fix a bug in HTMLParser.unescape that prevented it to escape more than 128 entities. Patch by Peter Otten. 15 years ago
Éric Araujo 39f180bb1f Fix display of html.parser.HTMLParser.feed docstring 15 years ago
Ezio Melotti 2e3607c1e7 #7311: fix html.parser to accept non-ASCII attribute values. 15 years ago
Senthil Kumaran 6c85838489 Merged revisions 87542 via svnmerge from 15 years ago
Senthil Kumaran 164540fee1 Fix Issue10759 - html.parser.unescape() fails on HTML entities with incorrect syntax 15 years ago
R. David Murray b579dba119 #1486713: Add a tolerant mode to HTMLParser. 15 years ago
Victor Stinner 30c223cff5 Merged revisions 81504 via svnmerge from 16 years ago
Victor Stinner e021f4b206 Recorded merge of revisions 81500-81501 via svnmerge from 16 years ago
Antoine Pitrou fd036451bf #2834: Change re module semantics, so that str and bytes mixing is forbidden, 18 years ago
Mark Dickinson f64dcf3ce0 Change test_htmlparser to reflect the HTMLParser -> html.parser 18 years ago
Georg Brandl bcdafa44f2 Remove html package and fix test_htmlparser. 18 years ago
Fred Drake d995e1150c revert creation of the html.entities and html.parser modules 18 years ago
Fred Drake 3c50ea4303 rename HTMLParser to html.parser and htmlentitydefs to html.entities; 18 years ago
Fred Drake 91ae250273 rename HTMLParser to html.parser, htmlentitydefs to html.entities 18 years ago
Fred Drake cb5c80f6d9 rename markupbase to _markupbase 18 years ago
Guido van Rossum 84fc66dd02 Rename 'unicode' to 'str' in its tp_name field. Rename 'str' to 'str8'. 19 years ago
Guido van Rossum ef87d6ed94 Rip out all the u"..." literals and calls to unicode(). 19 years ago
Guido van Rossum d8faa3654c Merged revisions 53952-54987 via svnmerge from 19 years ago
Martin v. Löwis ab8a6bba25 Patch #912410: Replace HTML entity references for attribute values 19 years ago
Georg Brandl cd3c26a717 Reverting previous checkin. This breaks too much of HTMLParser to be applied 21 years ago
Georg Brandl 7847405a76 bug [ 761452 ] HTMLParser chokes on my.yahoo.com output 21 years ago
Fred Drake 49b4d19172 remove unnecessary override of base class method 22 years ago
Andrew M. Kuchling b7d8ce0275 [Bug #921657] Allow '@' in unquoted HTML attributes. Not strictly legal according to the HTML REC, but HTMLParser is already a pretty loose parser. Reported by Bernd Zimmermann. 22 years ago
Walter Dörwald 70a6b49821 Replace backticks with repr() or "%r" 22 years ago
Fred Drake 0834d77bc4 Accept commas in unquoted attribute values. 23 years ago