|
Java EE 5 SDK 深圳电信培训中心.徐海蛟老师. |
||||||||
上一个类 下一个类 | 框架 无框架 | ||||||||
摘要: 嵌套 | 字段 | 构造器 | 方法 | 详细信息: 字段 | 构造器 | 方法 |
javax.mail.internet Class MimeUtility
java.lang.Object javax.mail.internet.MimeUtility
public class MimeUtility
- extends Object
This is a utility class that provides various MIME related functionality.
There are a set of methods to encode and decode MIME headers as per RFC 2047. A brief description on handling such headers is given below:
RFC 822 mail headers must contain only US-ASCII characters. Headers that contain non US-ASCII characters must be encoded so that they contain only US-ASCII characters. Basically, this process involves using either BASE64 or QP to encode certain characters. RFC 2047 describes this in detail.
In Java, Strings contain (16 bit) Unicode characters. ASCII is a subset of Unicode (and occupies the range 0 - 127). A String that contains only ASCII characters is already mail-safe. If the String contains non US-ASCII characters, it must be encoded. An additional complexity in this step is that since Unicode is not yet a widely used charset, one might want to first charset-encode the String into another charset and then do the transfer-encoding.
Note that to get the actual bytes of a mail-safe String (say, for sending over SMTP), one must do
byte[] bytes = string.getBytes("iso-8859-1");
The setHeader
and addHeader
methods
on MimeMessage and MimeBodyPart assume that the given header values
are Unicode strings that contain only US-ASCII characters. Hence
the callers of those methods must insure that the values they pass
do not contain non US-ASCII characters. The methods in this class
help do this.
The getHeader
family of methods on MimeMessage and
MimeBodyPart return the raw header value. These might be encoded
as per RFC 2047, and if so, must be decoded into Unicode Strings.
The methods in this class help to do this.
Several System properties control strict conformance to the MIME spec. Note that these are not session properties but must be set globally as System properties.
The mail.mime.decodetext.strict
property controls
decoding of MIME encoded words. The MIME spec requires that encoded
words start at the beginning of a whitespace separated word. Some
mailers incorrectly include encoded words in the middle of a word.
If the mail.mime.decodetext.strict
System property is
set to "false"
, an attempt will be made to decode these
illegal encoded words. The default is true.
The mail.mime.encodeeol.strict
property controls the
choice of Content-Transfer-Encoding for MIME parts that are not of
type "text". Often such parts will contain textual data for which
an encoding that allows normal end of line conventions is appropriate.
In rare cases, such a part will appear to contain entirely textual
data, but will require an encoding that preserves CR and LF characters
without change. If the mail.mime.encodeeol.strict
System property is set to "true"
, such an encoding will
be used when necessary. The default is false.
In addition, the mail.mime.charset
System property can
be used to specify the default MIME charset to use for encoded words
and text parts that don't otherwise specify a charset. Normally, the
default MIME charset is derived from the default Java charset, as
specified in the file.encoding
System property. Most
applications will have no need to explicitly set the default MIME
charset. In cases where the default MIME charset to be used for
mail messages is different than the charset used for files stored on
the system, this property should be set.
- 版本:
- 1.60, 07/05/15
- 作者:
- John Mani, Bill Shannon
字段摘要 | |
---|---|
static int |
ALL
|
方法摘要 | |
---|---|
static InputStream |
decode(InputStream is,
String encoding)
Decode the given input stream. |
static String |
decodeText(String etext)
Decode "unstructured" headers, that is, headers that are defined as '*text' as per RFC 822. |
static String |
decodeWord(String eword)
The string is parsed using the rules in RFC 2047 for parsing an "encoded-word". |
static OutputStream |
encode(OutputStream os,
String encoding)
Wrap an encoder around the given output stream. |
static OutputStream |
encode(OutputStream os,
String encoding,
String filename)
Wrap an encoder around the given output stream. |
static String |
encodeText(String text)
Encode a RFC 822 "text" token into mail-safe form as per RFC 2047. |
static String |
encodeText(String text,
String charset,
String encoding)
Encode a RFC 822 "text" token into mail-safe form as per RFC 2047. |
static String |
encodeWord(String word)
Encode a RFC 822 "word" token into mail-safe form as per RFC 2047. |
static String |
encodeWord(String word,
String charset,
String encoding)
Encode a RFC 822 "word" token into mail-safe form as per RFC 2047. |
static String |
fold(int used,
String s)
Fold a string at linear whitespace so that each line is no longer than 76 characters, if possible. |
static String |
getDefaultJavaCharset()
Get the default charset corresponding to the system's current default locale. |
static String |
getEncoding(DataHandler dh)
Same as getEncoding(DataSource) except that instead
of reading the data from an InputStream it uses the
writeTo method to examine the data. |
static String |
getEncoding(DataSource ds)
Get the content-transfer-encoding that should be applied to the input stream of this datasource, to make it mailsafe. |
static String |
javaCharset(String charset)
Convert a MIME charset name into a valid Java charset name. |
static String |
mimeCharset(String charset)
Convert a java charset into its MIME charset name. |
static String |
quote(String word,
String specials)
A utility method to quote a word, if the word contains any characters from the specified 'specials' list. |
static String |
unfold(String s)
Unfold a folded header. |
类方法继承 java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
字段详细信息 |
---|
ALL
public static final int ALL
- 另请参见:
- 常量字段
方法详细信息 |
---|
getEncoding
public static String getEncoding(DataSource ds)
- Get the content-transfer-encoding that should be applied
to the input stream of this datasource, to make it mailsafe.
The algorithm used here is:
- If the primary type of this datasource is "text" and if all the bytes in its input stream are US-ASCII, then the encoding is "7bit". If more than half of the bytes are non-US-ASCII, then the encoding is "base64". If less than half of the bytes are non-US-ASCII, then the encoding is "quoted-printable".
- If the primary type of this datasource is not "text", then if all the bytes of its input stream are US-ASCII, the encoding is "7bit". If there is even one non-US-ASCII character, the encoding is "base64".
- 参数:
ds
- DataSource- 返回:
- the encoding. This is either "7bit", "quoted-printable" or "base64"
getEncoding
public static String getEncoding(DataHandler dh)
- Same as
getEncoding(DataSource)
except that instead of reading the data from anInputStream
it uses thewriteTo
method to examine the data. This is more efficient in the common case of aDataHandler
created with an object and a MIME type (for example, a "text/plain" String) because all the I/O is done in this thread. In the case requiring anInputStream
theDataHandler
uses a thread, a pair of pipe streams, and thewriteTo
method to produce the data. - 从以下版本开始:
- JavaMail 1.2
decode
public static InputStream decode(InputStream is, String encoding) throws MessagingException
- Decode the given input stream. The Input stream returned is
the decoded input stream. All the encodings defined in RFC 2045
are supported here. They include "base64", "quoted-printable",
"7bit", "8bit", and "binary". In addition, "uuencode" is also
supported.
- 参数:
is
- input streamencoding
- the encoding of the stream.- 返回:
- decoded input stream.
- 抛出异常:
MessagingException
encode
public static OutputStream encode(OutputStream os, String encoding) throws MessagingException
- Wrap an encoder around the given output stream.
All the encodings defined in RFC 2045 are supported here.
They include "base64", "quoted-printable", "7bit", "8bit" and
"binary". In addition, "uuencode" is also supported.
- 参数:
os
- output streamencoding
- the encoding of the stream.- 返回:
- output stream that applies the specified encoding.
- 抛出异常:
MessagingException
encode
public static OutputStream encode(OutputStream os, String encoding, String filename) throws MessagingException
- Wrap an encoder around the given output stream.
All the encodings defined in RFC 2045 are supported here.
They include "base64", "quoted-printable", "7bit", "8bit" and
"binary". In addition, "uuencode" is also supported.
The
filename
parameter is used with the "uuencode" encoding and is included in the encoded output. - 参数:
os
- output streamencoding
- the encoding of the stream.filename
- name for the file being encoded (only used with uuencode)- 返回:
- output stream that applies the specified encoding.
- 抛出异常:
MessagingException
- 从以下版本开始:
- JavaMail 1.2
encodeText
public static String encodeText(String text) throws UnsupportedEncodingException
- Encode a RFC 822 "text" token into mail-safe form as per
RFC 2047.
The given Unicode string is examined for non US-ASCII characters. If the string contains only US-ASCII characters, it is returned as-is. If the string contains non US-ASCII characters, it is first character-encoded using the platform's default charset, then transfer-encoded using either the B or Q encoding. The resulting bytes are then returned as a Unicode string containing only ASCII characters.
Note that this method should be used to encode only "unstructured" RFC 822 headers.
Example of usage:
MimePart part = ... String rawvalue = "FooBar Mailer, Japanese version 1.1" try { // If we know for sure that rawvalue contains only US-ASCII // characters, we can skip the encoding part part.setHeader("X-mailer", MimeUtility.encodeText(rawvalue)); } catch (UnsupportedEncodingException e) { // encoding failure } catch (MessagingException me) { // setHeader() failure }
- 参数:
text
- Unicode string- 返回:
- Unicode string containing only US-ASCII characters
- 抛出异常:
UnsupportedEncodingException
- if the encoding fails
encodeText
public static String encodeText(String text, String charset, String encoding) throws UnsupportedEncodingException
- Encode a RFC 822 "text" token into mail-safe form as per
RFC 2047.
The given Unicode string is examined for non US-ASCII characters. If the string contains only US-ASCII characters, it is returned as-is. If the string contains non US-ASCII characters, it is first character-encoded using the specified charset, then transfer-encoded using either the B or Q encoding. The resulting bytes are then returned as a Unicode string containing only ASCII characters.
Note that this method should be used to encode only "unstructured" RFC 822 headers.
- 参数:
text
- the header valuecharset
- the charset. If this parameter is null, the platform's default chatset is used.encoding
- the encoding to be used. Currently supported values are "B" and "Q". If this parameter is null, then the "Q" encoding is used if most of characters to be encoded are in the ASCII charset, otherwise "B" encoding is used.- 返回:
- Unicode string containing only US-ASCII characters
- 抛出异常:
UnsupportedEncodingException
decodeText
public static String decodeText(String etext) throws UnsupportedEncodingException
- Decode "unstructured" headers, that is, headers that are defined
as '*text' as per RFC 822.
The string is decoded using the algorithm specified in RFC 2047, Section 6.1. If the charset-conversion fails for any sequence, an UnsupportedEncodingException is thrown. If the String is not an RFC 2047 style encoded header, it is returned as-is
Example of usage:
MimePart part = ... String rawvalue = null; String value = null; try { if ((rawvalue = part.getHeader("X-mailer")[0]) != null) value = MimeUtility.decodeText(rawvalue); } catch (UnsupportedEncodingException e) { // Don't care value = rawvalue; } catch (MessagingException me) { } return value;
- 参数:
etext
- the possibly encoded value- 抛出异常:
UnsupportedEncodingException
- if the charset conversion failed.
encodeWord
public static String encodeWord(String word) throws UnsupportedEncodingException
- Encode a RFC 822 "word" token into mail-safe form as per
RFC 2047.
The given Unicode string is examined for non US-ASCII characters. If the string contains only US-ASCII characters, it is returned as-is. If the string contains non US-ASCII characters, it is first character-encoded using the platform's default charset, then transfer-encoded using either the B or Q encoding. The resulting bytes are then returned as a Unicode string containing only ASCII characters.
This method is meant to be used when creating RFC 822 "phrases". The InternetAddress class, for example, uses this to encode it's 'phrase' component.
- 参数:
word
- Unicode string- 返回:
- Array of Unicode strings containing only US-ASCII characters.
- 抛出异常:
UnsupportedEncodingException
- if the encoding fails
encodeWord
public static String encodeWord(String word, String charset, String encoding) throws UnsupportedEncodingException
- Encode a RFC 822 "word" token into mail-safe form as per
RFC 2047.
The given Unicode string is examined for non US-ASCII characters. If the string contains only US-ASCII characters, it is returned as-is. If the string contains non US-ASCII characters, it is first character-encoded using the specified charset, then transfer-encoded using either the B or Q encoding. The resulting bytes are then returned as a Unicode string containing only ASCII characters.
- 参数:
word
- Unicode stringcharset
- the MIME charsetencoding
- the encoding to be used. Currently supported values are "B" and "Q". If this parameter is null, then the "Q" encoding is used if most of characters to be encoded are in the ASCII charset, otherwise "B" encoding is used.- 返回:
- Unicode string containing only US-ASCII characters
- 抛出异常:
UnsupportedEncodingException
- if the encoding fails
decodeWord
public static String decodeWord(String eword) throws ParseException, UnsupportedEncodingException
- The string is parsed using the rules in RFC 2047 for parsing
an "encoded-word". If the parse fails, a ParseException is
thrown. Otherwise, it is transfer-decoded, and then
charset-converted into Unicode. If the charset-conversion
fails, an UnsupportedEncodingException is thrown.
- 参数:
eword
- the encoded value- 抛出异常:
ParseException
- if the string is not an encoded-word as per RFC 2047.UnsupportedEncodingException
- if the charset conversion failed.
quote
public static String quote(String word, String specials)
- A utility method to quote a word, if the word contains any
characters from the specified 'specials' list.
The
HeaderTokenizer
class defines two special sets of delimiters - MIME and RFC 822.This method is typically used during the generation of RFC 822 and MIME header fields.
- 参数:
word
- word to be quotedspecials
- the set of special characters- 返回:
- the possibly quoted word
- 另请参见:
HeaderTokenizer.MIME
,HeaderTokenizer.RFC822
fold
public static String fold(int used, String s)
- Fold a string at linear whitespace so that each line is no longer
than 76 characters, if possible. If there are more than 76
non-whitespace characters consecutively, the string is folded at
the first whitespace after that sequence. The parameter
used
indicates how many characters have been used in the current line; it is usually the length of the header name.Note that line breaks in the string aren't escaped; they probably should be.
- 参数:
used
- characters used in line so fars
- the string to fold- 返回:
- the folded string
- 从以下版本开始:
- JavaMail 1.4
unfold
public static String unfold(String s)
- Unfold a folded header. Any line breaks that aren't escaped and
are followed by whitespace are removed.
- 参数:
s
- the string to unfold- 返回:
- the unfolded string
- 从以下版本开始:
- JavaMail 1.4
javaCharset
public static String javaCharset(String charset)
- Convert a MIME charset name into a valid Java charset name.
- 参数:
charset
- the MIME charset name- 返回:
- the Java charset equivalent. If a suitable mapping is not available, the passed in charset is itself returned.
mimeCharset
public static String mimeCharset(String charset)
- Convert a java charset into its MIME charset name.
Note that a future version of JDK (post 1.2) might provide this functionality, in which case, we may deprecate this method then.
- 参数:
charset
- the JDK charset- 返回:
- the MIME/IANA equivalent. If a mapping is not possible, the passed in charset itself is returned.
- 从以下版本开始:
- JavaMail 1.1
getDefaultJavaCharset
public static String getDefaultJavaCharset()
- Get the default charset corresponding to the system's current
default locale. If the System property
mail.mime.charset
is set, a system charset corresponding to this MIME charset will be returned. - 返回:
- the default charset of the system's default locale, as a Java charset. (NOT a MIME charset)
- 从以下版本开始:
- JavaMail 1.1
|
Java EE 5 SDK 深圳电信培训中心.徐海蛟老师. |
||||||||
上一个类 下一个类 | 框架 无框架 | ||||||||
摘要: 嵌套 | 字段 | 构造器 | 方法 | 详细信息: 字段 | 构造器 | 方法 |
提交错误或意见
版权所有 2007 Sun Microsystems, Inc. 保留所有权利。 请遵守许可证条款。深圳电信培训中心.徐海蛟老师教学参考.