新书推介:《语义网技术体系》
作者:瞿裕忠,胡伟,程龚
   >>中国XML论坛<<     W3CHINA.ORG讨论区     计算机科学论坛     SOAChina论坛     Blog     开放翻译计划     新浪微博  
 
  • 首页
  • 登录
  • 注册
  • 软件下载
  • 资料下载
  • 核心成员
  • 帮助
  •   Add to Google

    >> 讨论HTML、XHTML、Web2.0、Ajax、XUL, ExtJS, jQuery, JSON、Social Networking System(SNS)、Rich Internet Applications (RIA)、Tagging System、Taxonomy(tagsonomy,folkonomy)、XForms、XFrames、XInclude, XBL (XML Binding Language)等话题
    [返回] 中文XML论坛 - 专业的XML技术讨论区XML.ORG.CN讨论区 - XML技术『 HTML/XHTML/Ajax/Web 2.0/Web 3.0 』 → native2ascii 应用[转帖] 查看新帖用户列表

      发表一个新主题  发表一个新投票  回复主题  (订阅本版) 您是本帖的第 3229 个阅读者浏览上一篇主题  刷新本主题   树形显示贴子 浏览下一篇主题
     * 贴子主题: native2ascii 应用[转帖] 举报  打印  推荐  IE收藏夹 
       本主题类别:     
     zhu_ruixian 帅哥哟,离线,有人找我吗?射手座1983-12-2
      
      
      威望:2
      等级:大二期末(Java考了96分!)
      文章:406
      积分:3471
      门派:W3CHINA.ORG
      注册:2006/3/30

    姓名:(无权查看)
    城市:(无权查看)
    院校:(无权查看)
    给zhu_ruixian发送一个短消息 把zhu_ruixian加入好友 查看zhu_ruixian的个人资料 搜索zhu_ruixian在『 HTML/XHTML/Ajax/Web 2.0/Web 3.0 』的所有贴子 引用回复这个贴子 回复这个贴子 查看zhu_ruixian的博客楼主
    发贴心情 native2ascii 应用[转帖]

    native2ascii工具是一个很好的解决国际化问题的工具。具体的命令格式如下:

    native2ascii - Native-to-ASCII Converter
    将一个文件中的非拉丁或非UNICODE字符形式转换成拉丁或UNICODE字符的文件。

    native2ascii [options] [inputfile [outputfile]]

    Java编译器和其它的Java工具仅仅能够处理Latin-1或者Unicode(\udddd形式)编码的字符,native2ascii工具将包含其它编码形式的文件转换成拉丁或UNICODE格式。

    可选项参数
    -reverse
    执行还原操作:把一个文件从拉丁或UNICODE格式转换成本地编码格式的字符。

    编码对照表:
    Converter Class        Description
    -------------------------------------------------------------
    8859_1           ISO 8859-1
    8859_2           ISO 8859-2
    8859_3           ISO 8859-3
    8859_4           ISO 8859-4
    8859_5           ISO 8859-5
    8859_6           ISO 8859-6
    8859_7           ISO 8859-7
    8859_8           ISO 8859-8
    8859_9           ISO 8859-9
    Big5             Big5, Traditional Chinese
    CNS11643         CNS 11643, Traditional Chinese
    Cp037            USA, Canada(Bilingual, French), Netherlands,Portugal, Brazil, Australia
    Cp1006           IBM AIX Pakistan (Urdu)
    Cp1025           IBM Multilingual Cyrillic: Bulgaria, Bosnia,Herzegovinia, Macedonia(FYR)
    Cp1026           IBM Latin-5, Turkey
    Cp1046           IBM Open Edition US EBCDIC
    Cp1097           IBM Iran(Farsi)/Persian
    Cp1098           IBM Iran(Farsi)/Persian (PC)
    Cp1112           IBM Latvia, Lithuania
    Cp1122           IBM Estonia
    Cp1123           IBM Ukraine
    Cp1124           IBM AIX Ukraine
    Cp1125           IBM Ukraine (PC)
    Cp1250           Windows Eastern European
    Cp1251           Windows Cyrillic
    Cp1252           Windows Latin-1
    Cp1253           Windows Greek
    Cp1254           Windows Turkish
    Cp1255           Windows Hebrew
    Cp1256           Windows Arabic
    Cp1257           Windows Baltic
    Cp1258           Windows Vietnamese
    Cp1381           IBM OS/2, DOS People's Republic of China (PRC)
    Cp1383           IBM AIX People's Republic of China (PRC)
    Cp273            IBM Austria, Germany
    Cp277            IBM Denmark, Norway
    Cp278            IBM Finland, Sweden
    Cp280            IBM Italy
    Cp284            IBM Catalan/Spain, Spanish Latin America
    Cp285            IBM United Kingdom, Ireland
    Cp297            IBM France
    Cp33722          IBM-eucJP - Japanese (superset of 5050)
    Cp420            IBM Arabic
    Cp424            IBM Hebrew
    Cp437            MS-DOS United States, Australia, New Zealand,South Africa
    Cp500            EBCDIC 500V1
    Cp737            PC Greek
    Cp775            PC Baltic
    Cp838            IBM Thailand extended SBCS
    Cp850            MS-DOS Latin-1
    Cp852            MS-DOS Latin-2
    Cp855            IBM Cyrillic
    Cp857            IBM Turkish
    Cp860            MS-DOS Portuguese
    Cp861            MS-DOS Icelandic
    Cp862            PC Hebrew
    Cp863            MS-DOS Canadian French
    Cp864            PC Arabic
    Cp865            MS-DOS Nordic
    Cp866            MS-DOS Russian
    Cp868            MS-DOS Pakistan
    Cp869            IBM Modern Greek
    Cp870            IBM Multilingual Latin-2
    Cp871            IBM Iceland
    Cp874            IBM Thai
    Cp875            IBM Greek
    Cp918            IBM Pakistan(Urdu)
    Cp921            IBM Latvia, Lithuania (AIX, DOS)
    Cp922            IBM Estonia (AIX, DOS)
    Cp930            Japanese Katakana-Kanji mixed with 4370 UDC,superset of 5026
    Cp933            Korean Mixed with 1880 UDC, superset of 5029
    Cp935            Simplified Chinese Host mixed with 1880 UDC,superset of 5031
    Cp937            Traditional Chinese Host miexed with 6204 UDC,superset of 5033
    Cp939            Japanese Latin Kanji mixed with 4370 UDC,superset of 5035
    Cp942            Japanese (OS/2) superset of 932
    Cp948            OS/2 Chinese (Taiwan) superset of 938
    Cp949            PC Korean
    Cp950            PC Chinese (Hong Kong, Taiwan)
    Cp964            AIX Chinese (Taiwan)
    Cp970            AIX Korean
    EUCJIS           JIS, EUC Encoding, Japanese
    GB2312           GB2312, EUC encoding, Simplified Chinese
    GBK              GBK, Simplified Chinese
    ISO2022CN        ISO 2022 CN, Chinese
    ISO2022CN_CNS    CNS 11643 in ISO-2022-CN form, T. Chinese
    ISO2022CN_GB     GB 2312 in ISO-2022-CN form, S. Chinese
    ISO2022KR        ISO 2022 KR, Korean
    JIS              JIS, Japanese
    JIS0208          JIS 0208, Japanese
    KOI8_R           KOI8-R, Russian
    KSC5601          KS C 5601, Korean
    MS874            Windows Thai
    MacArabic        Macintosh Arabic
    MacCentralEurope Macintosh Latin-2
    MacCroatian      Macintosh Croatian
    MacCyrillic      Macintosh Cyrillic
    MacDingbat       Macintosh Dingbat
    MacGreek         Macintosh Greek
    MacHebrew        Macintosh Hebrew
    MacIceland       Macintosh Iceland
    MacRoman         Macintosh Roman
    MacRomania       Macintosh Romania
    MacSymbol        Macintosh Symbol
    MacThai          Macintosh Thai
    MacTurkish       Macintosh Turkish
    MacUkraine       Macintosh Ukraine
    SJIS             Shift-JIS, Japanese
    UTF8             UTF-8

    具体应用示例见:http://bbs.xml.org.cn/dispbbs.asp?boardID=22&ID=44779


       收藏   分享  
    顶(0)
      




    ----------------------------------------------
    为什么总是索取的人多,奉献的人少...

    点击查看用户来源及管理<br>发贴IP:*.*.*.* 2007/4/7 21:54:00
     
     GoogleAdSense射手座1983-12-2
      
      
      等级:大一新生
      文章:1
      积分:50
      门派:无门无派
      院校:未填写
      注册:2007-01-01
    给Google AdSense发送一个短消息 把Google AdSense加入好友 查看Google AdSense的个人资料 搜索Google AdSense在『 HTML/XHTML/Ajax/Web 2.0/Web 3.0 』的所有贴子 访问Google AdSense的主页 引用回复这个贴子 回复这个贴子 查看Google AdSense的博客广告
    2024/5/21 9:36:21

    本主题贴数1,分页: [1]

    管理选项修改tag | 锁定 | 解锁 | 提升 | 删除 | 移动 | 固顶 | 总固顶 | 奖励 | 惩罚 | 发布公告
    W3C Contributing Supporter! W 3 C h i n a ( since 2003 ) 旗 下 站 点
    苏ICP备05006046号《全国人大常委会关于维护互联网安全的决定》《计算机信息网络国际联网安全保护管理办法》
    1,511.719ms