JEditorPane中html文档中文乱码解决

2024-03-30 02:29•html•阅读 4453

Culturally dependent information in some documents is handled through a mechanism called character encoding. Character encoding is an unambiguous mapping of the members of a character set (letters, ideographs, digits, symbols, or control functions) to specific numeric code values. It represents the way the file is stored. Example character encodings are ISO-8859-1, ISO-8859-5, Shift-jis, Euc-jp, and UTF-8. When the file is passed to an user agent (JEditorPane) it is converted to the document character set (ISO-10646 aka Unicode).
There are multiple ways to get a character set mapping to happen with JEditorPane.
1. One way is to specify the character set as a parameter of the MIME type. This will be established by a call to the setContentType method. If the content is loaded by thesetPage method the content type will have been set according to the specification of the URL. It the file is loaded directly, the content type would be expected to have been set prior to loading.
2. Another way the character set can be specified is in the document itself. This requires reading the document prior to determining the character set that is desired. To handle this, it is expected that the EditorKit.read operation throw a ChangedCharSetException which will be caught. The read is then restarted with a new Reader that uses the character set specified in the ChangedCharSetException (which is an IOException).
setContentType
```
public final void setContentType(String type)
```
Sets the type of content that this editor handles. This calls getEditorKitForContentType, and then setEditorKit if an editor kit can be successfully located. This is mostly convenience method that can be used as an alternative to calling setEditorKit directly.
If there is a charset definition specified as a parameter of the content type specification, it will be used when loading input streams using the associated EditorKit. For example if the type is specified as text/html; charset=EUC-JP the content will be loaded using the EditorKit registered for text/html and the Reader provided to theEditorKit to load unicode into the document will use the EUC-JP charset for translating to unicode. If the type is not recognized, the content will be loaded using theEditorKit registered for plain text, text/plain.
Parameters:
type - the non-null mime type for the content editing support
Throws:
NullPointerException - if the type parameter is null
See Also:
getContentType()
所以总之就是调用setContentType("text/html;charset=utf-8");即可。

上一篇 »将Android 工程从Windows导入到Ubuntu 下java类中的中文在ADT中乱码
下一篇 »PHP中非常全面的乱码解决方案

JEditorPane中html文档中文乱码解决

setContentType

相关推荐

php mssql 中文各种乱码

notepad++ 中 export to html乱码的解决办法

php数组转json中文乱码解决办法

java解决传参乱码

Java使用Scanner接收中文并输出时出现乱码

struts使用html:file上传文件的时候文件名乱码解决

编译的java工程压缩上传到linux服务器上后，中文的类名显示乱码

springmvc映射html文件以及解决乱码问题