Ò׽ؽØÍ¼Èí¼þ¡¢µ¥Îļþ¡¢Ãâ°²×°¡¢´¿ÂÌÉ«¡¢½ö160KB

ÖпÆÔº·Ö´Ê¹¤¾ßimdict chinese analyzerѧϰ java·Ö´Ê

ÏÂÔØÁ´½Óhttp://ictclas.org/Down_OpenSrc.asp
¼òµ¥½éÉÜ£º
 imdict-chinese-analyzerÊÇ imdictÖÇÄܴʵäµÄÖÇÄÜÖÐÎÄ·Ö´ÊÄ£¿é£¬×÷Õ߸ßСƽ£¬Ëã·¨»ùÓÚÒþÂí
¶û¿Æ·òÄ£ÐÍ(Hidden Markov Model, HMM)£¬ÊÇÖйú¿ÆÑ§Ôº¼ÆËã¼¼ÊõÑо¿ËùµÄictclasÖÐÎķִʳÌÐò
µÄÖØÐÂʵÏÖ£¨»ùÓÚJava£©£¬¿ÉÒÔÖ±½ÓΪluceneËÑË÷ÒýÇæÌṩÖÐÎÄ·Ö´ÊÖ§³Ö¡£
Ó¦Óãº
ϵ½µÄѹËõ°ü½âѹºó¾ÍÊÇÒ»¸öjava¹¤³Ì£¬eclipseÖ±½Óµ¼Èë¼´¿É£¬µ«ÓÉÓÚÆä¿ª·¢µÄ»·¾³ÊÇUTF8ËùÒÔ
Òª½«eclipseµÄ¹¤×÷¿Õ¼äµÄ±àÂëÒ²ÉèÖÃΪutf8£¬test°üÀïÃæµÄAnalyzerTest¾ÍÊÇÆäÓ÷¨£¬¿´ÁËÒÔºó
¾Í¿ÉÒÔÖ±½ÓÓÃÁË
¹¦ÄÜ£ºÖÐÎÄ·Ö´Ê¡¢Í£Ö¹´Ê¹ýÂË
Óŵ㣺¿ªÔ´£¬·Ö´ÊËٶȿ죬ЧÂʸß
ȱµã£º²»Ö§³Ö×Ô¼ºÌí¼Ó´Ê¿â£¬²»Ö§³Ö´ÊÐÔ±ê×¢£¨¿ª·¢ÈËÔ±×Ô¼ºËµÊÇΪÁËÌá¸ßËÙ¶È£©£¬dataÎļþ¼Ð½ö
×Ô´øÁËÁ½¸ö×ÖµäcoredictºËÐÄ×ֵ䡢bigramdict´Ê¹ØÏµ×ֵ䣬ÕâÊÇÁ½¸ö×îÖØÒªµÄ´Êµä£¬Ã»ÓеØÃûºÍ
ÈËÃûµÄ´Êµä£¬ËùÒÔҪʶ±ðÈËÃûµØÃû±È½ÏÂé·³£¬¾Ý˵ҪÓòã´Îhmm£¬ÏÈ´Ö·ÖÔÚϸ·Ö¡£
ÉîÈëѧϰ£ºÖ÷ÀàÊÇnet.imdict.analysis.chineseÖеÄChineseAnalyzer.javaËü¼Ì³ÐÁËluceneµÄ
AnalyzerÀ࣬ÓÐÁ½¸ö¹¹Ôì·½·¨£ºpublic ChineseAnalyzer()¡¢public ChineseAnalyzer
(Set<String> stopWords)µÚ¶þ¸ö¹¹Ôì·½·¨Ö§³ÖÍ£Óôʣ¬×îÖØÒªµÄÊÇtokenStreamº¯Êý£¬ËüÓÃÁË
SentenceTokenizerºÍnew WordTokenizer£¬Ç°Ò»¸öÊǽ«ÎÄÕ·ֳɾä×Ó£¬ºóÒ»¸öÊǽ«¾ä×ӷֳɵ¥´Ê£¬
µ¥´ÊºÍ¾ä×Ó¶¼ÊÇÓÃLuceneµÄToken£¨´Ê£©µÄÀà´æ´¢µÄ£¬£¨TokenÊÇÒ»¸ö³éÏóÀ࣬TokenStreamÊÇToken
ÀàµÄ×ÓÀ࣬µ«Ò²ÊÇÒ»¸ö³éÏóÀ࣬TokenizerºÍTokenFilterÔòÊÇTokenStreamµÄ¾ßÌåʵÏÖ£¬ËûÃÇʵÏÖ
ÁËTokenStreamµÄnext()·½·¨£¬TokenizerµÄnext·½·¨·µ»ØµÄÊÇԭʼµÄ¡¢ÇзֳöÀ´µÄ´Ê£¬¶ø
TokenFilter·½·¨·µ»ØµÄÊÇÒ»¸ö¾­¹ý¹ýÂ˵ĴÊÌõ£¬ËûÃǽáºÏÆðÀ´ÐγÉLucene·ÖÎöÆ÷µÄºËÐĽṹ£©Èç
Token token = new Token()£¬È»ºóͨ¹ýtoken.reinit(buffer.toString(), tokenStart,
tokenEnd, "sentence");ÖмäÁ½¸ö²ÎÊýÊÇToken´æ´¢µÄ×Ö·û´®µÄÆðֹλÖã¬ÒÔ0¿ªÊ¼¼ÆÊý£¬ÒýÓÃ
tokenÖÐ×Ö·û´®µÄº¯ÊýÊÇtoken.term()£¬ÕæÕýµ÷Ó÷ִʺËÐÄËã·¨µÄWordSegmenterµÄ
segmentSentence·½·¨¶Ô¾ä×Ó½øÐзִʣ¬ÔÚWordTokenizerÀàÖе÷ÓÃËüµÃµ½·Ö´Ê½á¹û¡£ÔÚÍùϲãµÄ´ú
ÂëÎÒ¾Íû¿´ÁË¡£
Á½¸ö¸Ä¶¯£º
£¨1£©ChineseAnalyzerÖ»ÄܶÔÎļþ½øÐзִʣ¬ÈçºÎ¶ÔÒ»¸ö×Ö·û´®½øÐзִʣ¬¸Ä¶¯ÈçÏÂ
/*  TokenStream ts = ca.tokenStream("sentence", new InputStreamRe


Ïà¹ØÎĵµ£º

IBM FileNet Content Java API ¼ò½é

2008 Äê 6 ÔÂ 24 ÈÕ
Ô­ÎĵØÖ·£º http://www.ibm.com/developerworks/cn/data/library/techarticles/dm-0806wangys/
±¾ÎĽéÉÜ IBM FileNet P8 4.0 Platform ÌṩµÄ Content Java API¡£Ê×ÏÈ¶Ô FileNet P8 Content Engine ºÍ API ½øÐиÅÒª½éÉÜ, ²¢ËµÃ÷ÁËһЩ»ù±¾¸ÅÄî£¬ËæºóÏêϸ½éÉÜÁË FileNet Content EngineÌṩµÄ»ùÓÚ EJB ......

javaÖеÄÏÝÚ壬Äã×¢ÒâÁËô£¿

´ð°¸Òþ²ØÁË£¬Ctrl+AÏÔʾ¡£½¨ÒéÏÈ˼¿¼Ò»Ï½á¹û£¬È»ºóÔËÐдúÂëÊÔÑé¡£Ò²ÐíÄã»á»ÐÈ»´óÎò¡£
1¡¢ÕÒÆæÊý£º
view plaincopy to clipboardprint?
public static boolean isOdd(int i){
return i % 2 == 1;
}
public static boolean isOdd(int i){
return i % 2 == 1;
}
ÉÏÃæµÄ·½·¨ÕæµÄÄÜÕÒµ ......

android java±à³Ì×¢ÒâÊÂÏî

1¡¢´´½¨ÁËÒ»¸ö¶ÔÏóºó£º £¨1£©Ã»ÓÐÔÚÊʵ±µÄµØ·½Êͷŵô £¨2£©ÔÚÓ¦¸ÃÊͷŵĵط½Ã»ÓÐ×öÊͷŲÙ×÷ ÀýÈ磺ÏÂÃæÒ»¶Î³ÌÐò£º m_progressDlg = ProgressDialog.show(this, getString(R.string.market),getString(R.string.is_visiting), true);
   new Thread() {
    public void run() { ......

javaÖн«ÖÐÎÄת»»Îªunicode±àÂë

jdk/binĿ¼ÏÂnative2ascii.exeÎļþ¿ÉÒÔÖ±½Ó½«ÖÐÎÄת³Éunicode.
cmd½øµ½binĿ¼Ï£¬ÔËÐÐnative2ascii.exe¡£ÊäÈëÖÐÎĻسµºó¾ÍÉú³ÉÁËunicode±àÂë¡£
Ö±½ÓË«»÷ÔËÒ²ÐУ¬ÎÒÔÚCMDÏÂÔËÐÐÖ÷ÒªÊÇΪÁË·½±ã½«×ª»»ºóµÄ±àÂ뿽±´³öÀ´¡£
Àý£º
cmd
cd C:\Java\jdk1.5.0_17
C:\Java\jdk1.5.0_17>native2ascii
Ó¬Âñ²Ø  &nb ......

java¿ò¼ÜѧϰÌå»á

      ×î½üÔÚѧϰjavaµÄ¿ò¼Ü£¬¼ÓÉîÁ˶ÔjavaһЩÉè¼ÆÄ£Ê½µÄÀí½â£¬ÀýÈçÄ£°å·½·¨Ä£Ê½¡¢²ßÂÔģʽ¡¢´úÀíģʽºÍ¶¯Ì¬´úÀíģʽ£¬·¢ÏÖѧϰÀí½âservletºÍjspºóºÜÈÝÒ×¾Íѧ»áÁËstruts£¬Ñ§Ï°jdbcºó±È½ÏÈÝÒ×Àí½âhibernate£¬±¾ÈË»¹Ã»¿ªÊ¼Ñ§Ï°spring¡£
      ¶Ôstruts¿ò¼ÜµÄÀí½â£ºÖ÷ÒªÊ ......
© 2009 ej38.com All Rights Reserved. ¹ØÓÚE½¡ÍøÁªÏµÎÒÃÇ | Õ¾µãµØÍ¼ | ¸ÓICP±¸09004571ºÅ