PythonÖеıàÂë
	
    
    
	
pythonÖеıàÂë
ÔÎÄ£ºhttp://users.ir-lab.org/~liulong/blog/archives/001962.html
¼ÆËã»úÒÔ0,1¶þ½øÖÆÎ»À´´æ´¢ÐÅÏ¢,ËùÒÔ×Ö·ûÔÚ¼ÆËã»úÖеıíʾҲÊǶþ½øÖÆÎ»,ÄÇÿ¸ö×Ö·ûËù¶ÔÓ¦µÄ¶þ½øÖÆÎ»ÊÇʲô,ÔÚ¿ªÊ¼µÄ¼ÆËã»úÖÐÒòΪֻ¿¼ÂÇÓ¢Óï, ËùÒÔ²»³¬¹ý256¸ö×Ö·û,¼´ÓÃÒ»¸ö×Ö½Ú(8bit)¾Í×ã¿ÉÒÔ±íʾËùÓеÄ×Ö·û,Õâ¸ö¶ÔÓ¦¹ØÏµ¾ÍÊǶÔ×Ö·ûµÄ±àÂë,ÓÃÒ»¸öΨһµÄ×Ö½ÚÂëÀ´±íʾΨһµÄ×Ö·û.µ«ÊÇËæ ×ÅÒª¶Ôºº×Ö,ÈÕÓïµÈÆäËûÓïÑԵĴ¦Àí,ÕâÑùµÄ±àÂëÂú×ã²»ÁËÐèÇó,¸÷¸öµØÇø,¹ú¼Ò¾ÍÕë¶Ô¸÷×ÔµÄÓïÑÔ½¨Á¢ÁË×Ô¼ºµÄÒ»Ì×±àÂë,±ÈÈçgb2312,gbk,µÈ µÈ....ÕâÑùÊǽâ¾öÁËÒ»²¿·ÖÎÊÌâ,µ«ÊÇÕâ¾Í¸ø²»Í¬µÄϵͳ,ƽ̨֮¼äµÄ½»»¥Ôì³ÉÁ˺ܴóµÄÕϰ,ΪÁ˽â¾ö´ËÎÊÌâ,³öÏÖÁËunicode,ËüΪÿÖÖÓïÑÔÖеÄÿ ¸ö×Ö·ûÉ趨ÁËͳһ²¢ÇÒΨһµÄ¶þ½øÖƱàÂ룬ÒÔÂú×ã¿çÓïÑÔ¡¢¿çƽ̨½øÐÐÎı¾»ù׼ת»»¡¢´¦ÀíµÄÒªÇó¡£Ã¿¸ö×Ö·û¶¼ÓÃÈô¸É¸ö×Ö½ÚÀ´±íʾ,ÕâÑù¾Í½â¾öÁËÆ½Ì¨,ϵͳ֮¼ä µÄ½»»¥ÎÊÌâ,µ«ÊÇunicodeÓиöȱµã,ÒòΪËüÿ¸ö×Ö·û¶¼ÓÃÈô¸É×Ö½ÚÀ´±íʾ,¼´Ê¹Êǵ¥×Ö½ÚµÄ×Ö·û,Õâ¾ÍÔì³ÉÁËʱ¼äºÍ¿Õ¼äÉϵÄÀË·Ñ,Òò´Ë³öÏÖÁËutf,ÊÇ Öмäת»»±àÂë,ÓÐutf8,utf16,utf7µÈ.³öÏÖÁ˶àÖÖ±àÂëÖ®ºó,ÔÚ²»Í¬µÄϵͳ,ƽ̨֮¼ä»ò³ÌÐò½Ó¿ÚÖÐ񻃾¼°µ½±àÂëµÄת»»,³£¼ûµÄת»»ÓÐ:
1.unicode->ÆäËü±àÂë
ÀýÈ磺aΪunicode±àÂë ҪתΪgb2312:a.encode('gb2312')
2.ÆäËü±àÂë->unicode
ÀýÈ磺aΪgb2312±àÂ룬ҪתΪunicode: unicode(a, 'gb2312')»òa.decode('gb2312')
3,±àÂë1 -> ±àÂë2
ÏÈתΪunicodeÔÙתΪ±àÂë2
Èçgb2312תutf8
unicode(a, 'gb2312').encode(utf-8)
ĿǰΪֹÎÒ²»ÖªµÀÈçºÎÅжÏÒ»¸ö×Ö·ûµÄ±àÂëÊǺÎÖÖ±àÂë,Ö»ÊÇ´Ó³ÌÐòÖÐÅ×Òì³£,µÈÆäËû·½Ê½À´ÅжÏ,µ«ÊÇÓпÉÒÔÅжÏÊÇ·ñÊÇunicodeµÄ·½·¨:
isinstance(s, str) ÓÃÀ´ÅжÏÊÇ·ñΪһ°ã×Ö·û´®
isinstance(s, unicode) ÓÃÀ´ÅжÏÊÇ·ñΪunicode
Èç¹ûÒª°ÑÒ»¶¨±àÂëµÄ×Ö·ûÐòÁÐдµ½ÎļþÀï,Ö»Òª°Ñ×Ö·ûÐòÁбàÂëΪËùÐè±àÂë¼´¿É,ÀýÈç:
l = 'ÁõÁú'
l = unicode(l, 'cp936')
l = l.encode('utf-8')
open('test.txt','w').write(l)
····ºÇºÇ£¬Unicode¾ÍÏñÒ»¸öƽ̨£¬°Ñ´ó¼Ò¶¼Í³Ò»µ½Ò»ÆðÁË¡£µ«ÊÇÓеãÀ˷ѿռ䰡£¬ËùÒÔ´ó¼Ò¾ö¶¨×Ô¼ºµ½ÁËͳһƽ̨֮ºóÔÙ¶¨ÖÆÒ»ÏÂÒ²²»´í¡£
---˳±ãÍÆ¼öÒ»¸ö¼ì²â±àÂëÀàÐ͵ĺö«Î÷codedet£ºhttp://chardet.feedparser.org/
    
     
	
	
    
    
	Ïà¹ØÎĵµ£º
        
    
       PythonÊÇÒ»ÖÖÃæÏò¶ÔÏóµÄ½âÊÍÐԵļÆËã»ú³ÌÐòÉè¼ÆÓïÑÔ£¬Ò²ÊÇÒ»ÖÖ¹¦ÄÜÇ¿´ó¶øÍêÉÆµÄͨÓÃÐÍÓïÑÔ£¬ÒѾ¾ßÓÐÊ®¶àÄêµÄ·¢Õ¹ÀúÊ·£¬³ÉÊìÇÒÎȶ¨¡£Python ¾ßÓнű¾ÓïÑÔÖÐ×î·á¸»ºÍÇ¿´óµÄÀà¿â£¬×ãÒÔÖ§³Ö¾ø´ó¶àÊýÈÕ³£Ó¦Óá£Ëü¾ßÓмòµ¥¡¢Ò×ѧ¡¢Ãâ·Ñ¡¢¿ªÔ´¡¢¿ÉÒÆÖ²ÐÔ¡¢½âÊÍÐÔ¡¢ÃæÏò¶ÔÏ󡢿ÉÀ©Õ¹ÐÔ¡¢¿ÉǶÈëÐÔÒÔ¼°·á¸»µÄ¿âµÈÌØÐÔ£¬ ......
	
    
        
    
     Ò»°ã°²×°µÄ¶¼ÊÇPython22°æ£¬wincvs1.3ÐèÒªpython2.1°æ±¾¼°ÒÔÉÏ¡£µ«ÊÇÆô¶¯¹ý³ÌÕÒ²»µ½£¬ÍøÉÏËÑË÷µÄ·½·¨²»´óÊÊÓá£×îºó¾¹ýÊÔÑé·¢ÏÖ£¬°ÑPython22°²×°Â·¾¶ÏµÄpython22.dll¿½±´µ½wincvsµÄ°²×°Ä¿Â¼Ï¡£ÔËÐÐwincvs£¬ok¡£¾õµÃÕâ¸ö·½·¨ºÃÓõĸø¶¥Ï£¡ ......
	
    
        
    
    
Chapter 1
Python and XML
Python and XML are two very different animals, each with a rich
history. Python is a full-scale programming language that has grown
from scripting world roots in a very organic way, through the vision
and guidance of Python's inventor, Guido van Rossum. Guido continue ......
	
    
        
    
    ÖÐÎļò½é
¡¡¡¡Python (·¢Òô:[ 'paiθ(?)n; (US) 'paiθ?n ]n.òþÉߣ¬¾ÞÉß )£¬ÊÇÒ»ÖÖÃæÏò¶ÔÏóµÄ½âÊÍÐԵļÆËã»ú³ÌÐòÉè¼ÆÓïÑÔ£¬Ò²ÊÇÒ»ÖÖ¹¦ÄÜÇ¿´ó¶øÍêÉÆµÄͨÓÃÐÍÓïÑÔ£¬ÒѾ¾ßÓÐÊ®¶àÄêµÄ·¢Õ¹ÀúÊ·£¬³ÉÊìÇÒÎȶ¨¡£Python ¾ßÓнű¾ÓïÑÔÖÐ×î·á¸»ºÍÇ¿´óµÄÀà¿â£¬×ãÒÔÖ§³Ö¾ø´ó¶àÊýÈÕ³£Ó¦Óá£ËüµÄÃû×ÖÀ´Ô´ÓÚÒ»¸öϲ¾ç,Ò²Ðí×î³õÉè ......
	
    
        
    
     
Ê×ÏÈÒª¸ãÇå³þ£¬×Ö·û´®ÔÚPythonÄÚ²¿µÄ±íʾÊÇunicode±àÂ룬Òò´Ë£¬ÔÚ×ö±àÂëת»»Ê±£¬Í¨³£ÐèÒªÒÔunicode×÷ΪÖмä±àÂ룬¼´ÏȽ«ÆäËû±àÂëµÄ×Ö·û´®½âÂ루decode£©³Éunicode£¬ÔÙ´Óunicode±àÂ루encode£©³ÉÁíÒ»ÖÖ±àÂë¡£ 
decodeµÄ×÷ÓÃÊǽ«ÆäËû±àÂëµÄ×Ö·û´®×ª»»³Éunicode±àÂ룬Èçstr1.decode('gb2312')£¬±íʾ½«gb2312±àÂëµÄ×Ö· ......