cpython

Commit Graph

Author	SHA1	Message	Date
R David Murray	749073af13	#1874 : detect invalid multipart CTE and report it as a defect.	15 years ago
R David Murray	3edd22ac95	#11731 : simplify/enhance parser/generator API by introducing policy objects. This new interface will also allow for future planned enhancements in control over the parser/generator without requiring any additional complexity in the parser/generator API. Patch reviewed by Éric Araujo and Barry Warsaw.	15 years ago
R David Murray	8437fe2708	Remove unused method from internal class.	15 years ago
R David Murray	c5c1472895	#11605 : don't use set/get_payload in feedparser; they do conversions. Really the whole API needs to be gone over to restore the separation of concerns; but that's what email6 is about.	15 years ago
R. David Murray	96fd54eaec	#4661 : add bytes parsing and generation to email (email version bump to 5.1.0) The work on this is not 100% complete, but everything is present to allow real-world testing of the code. The only remaining major todo item is to (hopefully!) enhance the handling of non-ASCII bytes in headers converted to unicode by RFC2047 encoding them rather than replacing them with '?'s.	16 years ago
R. David Murray	6d4a06c91e	Merged revisions 82922 via svnmerge from svn+ssh://pythondev@svn.python.org/python/branches/py3k ........ r82922 \| r.david.murray \| 2010-07-16 21:19:57 -0400 (Fri, 16 Jul 2010) \| 4 lines #1555570: correctly handle a \r\n that is split by the read buffer. Patch and test by Tony Nelson. ........	16 years ago
R. David Murray	45bf773f60	#1555570 : correctly handle a \r\n that is split by the read buffer. Patch and test by Tony Nelson.	16 years ago
R. David Murray	71df9d9216	Merged revisions 82011 via svnmerge from svn+ssh://pythondev@svn.python.org/python/branches/py3k ................ r82011 \| r.david.murray \| 2010-06-15 22:19:40 -0400 (Tue, 15 Jun 2010) \| 17 lines Merged revisions 81675 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r81675 \| r.david.murray \| 2010-06-03 11:43:20 -0400 (Thu, 03 Jun 2010) \| 10 lines #5610: use \Z not $ so we don't eat extra chars when body part ends with \r\n. If a body part ended with \r\n, feedparser, using '$' to terminate its search for the newline, would match on the \r\n, and think that it needed to strip two characters in order to account for the line end before the boundary. That made it chop one too many characters off the end of the body part. Using \Z makes the match correct. Patch and test by Tony Nelson. ........ ................	16 years ago
R. David Murray	45e0e1444b	Merged revisions 81675 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r81675 \| r.david.murray \| 2010-06-03 11:43:20 -0400 (Thu, 03 Jun 2010) \| 10 lines #5610: use \Z not $ so we don't eat extra chars when body part ends with \r\n. If a body part ended with \r\n, feedparser, using '$' to terminate its search for the newline, would match on the \r\n, and think that it needed to strip two characters in order to account for the line end before the boundary. That made it chop one too many characters off the end of the body part. Using \Z makes the match correct. Patch and test by Tony Nelson. ........	16 years ago
Guido van Rossum	3172c5d263	Patch# 1258 by Christian Heimes: kill basestring. I like this because it makes the code shorter! :-)	19 years ago
Guido van Rossum	8b3febef2f	Copying the email package back, despite its failings.	19 years ago
Guido van Rossum	6398b7a351	Remove the email package for now. Once Barry and the email-sig have a working new version we'll add it back. If it doesn't make the 3.0a deadline (release August 31), too bad.	19 years ago
Georg Brandl	a18af4e7a2	PEP 3114: rename .next() to .__next__() and add next() builtin.	19 years ago
Thomas Wouters	49fd7fa443	Merge p3yk branch with the trunk up to revision 45595. This breaks a fair number of tests, all because of the codecs/_multibytecodecs issue described here (it's not a Py3K issue, just something Py3K discovers): http://mail.python.org/pipermail/python-dev/2006-April/064051.html Hye-Shik Chang promised to look for a fix, so no need to fix it here. The tests that are expected to break are: test_codecencodings_cn test_codecencodings_hk test_codecencodings_jp test_codecencodings_kr test_codecencodings_tw test_codecs test_multibytecodec This merge fixes an actual test failure (test_weakref) in this branch, though, so I believe merging is the right thing to do anyway.	20 years ago
Barry Warsaw	6153201274	SF bug #1347874 ; FeedParser does not comply with RFC2822. Change headerRE as suggested in the bug report, so that single character headers are accepted. Test case added too. Will backport to Python 2.4.	20 years ago
Barry Warsaw	7cf9ce2440	Fixes for SF #1076485 , which I'll apply to the CVS head too. The problem was caused by a self._input.readline() call that wasn't checking for the NeedsMoreData marker. msg_43.txt contains a message that illustrates the problem, when email.message_from_*() is called. That interface uses the Parser API, which splits reads into 8192 byte chunks. It so happens that for the test message, the 8192 chunk falls inside a message/delivery-status, which is where in the FeedParser the readline() call was that didn't check for NeedsMoreData. I also added an assert to unreadline() so it'll be more evident if an attempt to push back NeedsMoreData ever happens again. Bump the email package version number.	22 years ago
Barry Warsaw	f4c7c402d4	RFC 2822 describes the characters allowed in a header field name. Conform to this, and add test cases.	22 years ago
Barry Warsaw	2e8c1f189a	Fix for SF bug #1072623 . When the last line of the input string does not end in a newline, and it's an end boundary, the FeedParser wasn't recognizing it as such. Tweak the regexp to make the ending linesep optional. For grins, clear self._partial when closing the BufferedSubFile. Added a test case.	22 years ago
Barry Warsaw	dee0cf12e3	Fix SF bug # 1030941. In _parsegen(), in the clause where we're capturing_preamble but we found a StartBoundaryNotFoundDefect, we need to consume all lines from the current position to the EOF, which we'll set as the epilogue of the current message. If we're not at EOF when we return from here, the outer message's capturing_preamble assertion will fail.	22 years ago
Barry Warsaw	bb11386730	Big email 3.0 API changes, with updated unit tests and documentation. Briefly (from the NEWS file): - Updates for the email package: + All deprecated APIs that in email 2.x issued warnings have been removed: _encoder argument to the MIMEText constructor, Message.add_payload(), Utils.dump_address_pair(), Utils.decode(), Utils.encode() + New deprecations: Generator.__call__(), Message.get_type(), Message.get_main_type(), Message.get_subtype(), the 'strict' argument to the Parser constructor. These will be removed in email 3.1. + Support for Python earlier than 2.3 has been removed (see PEP 291). + All defect classes have been renamed to end in 'Defect'. + Some FeedParser fixes; also a MultipartInvariantViolationDefect will be added to messages that claim to be multipart but really aren't. + Updates to documentation.	22 years ago
Barry Warsaw	8896bf56a2	Resolution of SF bug #1002475 and patch #1003693 ; Header lines that end in \r\n only get the \n stripped, not the \r (unless it's the last header which does get the \r stripped). Patch by Tony Meyer. test_whitespace_continuation_last_header(), test_strip_line_feed_and_carriage_return_in_headers(): New tests. _parse_headers(): Be sure to strip \r\n from the right side of header lines.	22 years ago
Barry Warsaw	e4aeb7d1f1	_parsegen(): Add a missing check for NeedMoreData.	22 years ago
Barry Warsaw	4e59bc1e67	readline(): RFC 2046, section 5.1.2 (and partially 5.1) both state that the parser must recognize outer boundaries in inner parts. So cruise through the EOF stack backwards testing each predicate against the current line. There's still some discussion about whether this is (always) the best thing to do. Anthony would rather parse these messages as if the outer boundaries were ignored. I think that's counter to the RFC, but might be practically more useful. Can you say behavior flag? (ug).	22 years ago
Barry Warsaw	486cb0ac2a	Tests for message/external-body and for duplicate boundary lines.	22 years ago
Barry Warsaw	d38f448865	_parsegen(): Move the message/rfc822 clause to after the message/delivery-status clause, and genericize it to handle all (other) message/* content types. This lets us correctly parse 2 more of Anthony's MIME torture tests (specifically, the message/external-body examples).	22 years ago
Barry Warsaw	5b44cd64d7	_parsegen(): Watch out for empty epilogues.	22 years ago
Barry Warsaw	c29db26529	_parse_headers(): Strip a trailing newline from the envelope header. Closes SF #951088.	22 years ago
Barry Warsaw	418101fd64	An updated FeedParser that should be RFC complaint, passes all existing (standard) tests, and doesn't throw parse errors. I still need throw Anthony's torture test at it, but I wanted to get this checked in and off my disk.	22 years ago
Anthony Baxter	39a0f04421	New parser. Next up, making the current parser use this parser	22 years ago

12 Commits (261ccdce4825535d4f6ea4bf09e9394bb751df20)