ÖпÆÔº·Ö´Ê¹¤¾ßimdict chinese analyzerѧϰ java·Ö´Ê
ÏÂÔØÁ´½Óhttp://ictclas.org/Down_OpenSrc.asp
¼òµ¥½éÉÜ£º
imdict-chinese-analyzerÊÇ imdictÖÇÄܴʵäµÄÖÇÄÜÖÐÎÄ·Ö´ÊÄ£¿é£¬×÷Õ߸ßСƽ£¬Ëã·¨»ùÓÚÒþÂí
¶û¿Æ·òÄ£ÐÍ(Hidden Markov Model, HMM)£¬ÊÇÖйú¿ÆÑ§Ôº¼ÆËã¼¼ÊõÑо¿ËùµÄictclasÖÐÎķִʳÌÐò
µÄÖØÐÂʵÏÖ£¨»ùÓÚJava£©£¬¿ÉÒÔÖ±½ÓΪluceneËÑË÷ÒýÇæÌṩÖÐÎÄ·Ö´ÊÖ§³Ö¡£
Ó¦Óãº
ϵ½µÄѹËõ°ü½âѹºó¾ÍÊÇÒ»¸öjava¹¤³Ì£¬eclipseÖ±½Óµ¼Èë¼´¿É£¬µ«ÓÉÓÚÆä¿ª·¢µÄ»·¾³ÊÇUTF8ËùÒÔ
Òª½«eclipseµÄ¹¤×÷¿Õ¼äµÄ±àÂëÒ²ÉèÖÃΪutf8£¬test°üÀïÃæµÄAnalyzerTest¾ÍÊÇÆäÓ÷¨£¬¿´ÁËÒÔºó
¾Í¿ÉÒÔÖ±½ÓÓÃÁË
¹¦ÄÜ£ºÖÐÎÄ·Ö´Ê¡¢Í£Ö¹´Ê¹ýÂË
Óŵ㣺¿ªÔ´£¬·Ö´ÊËٶȿ죬ЧÂʸß
ȱµã£º²»Ö§³Ö×Ô¼ºÌí¼Ó´Ê¿â£¬²»Ö§³Ö´ÊÐÔ±ê×¢£¨¿ª·¢ÈËÔ±×Ô¼ºËµÊÇΪÁËÌá¸ßËÙ¶È£©£¬dataÎļþ¼Ð½ö
×Ô´øÁËÁ½¸ö×ÖµäcoredictºËÐÄ×ֵ䡢bigramdict´Ê¹ØÏµ×ֵ䣬ÕâÊÇÁ½¸ö×îÖØÒªµÄ´Êµä£¬Ã»ÓеØÃûºÍ
ÈËÃûµÄ´Êµä£¬ËùÒÔҪʶ±ðÈËÃûµØÃû±È½ÏÂé·³£¬¾Ý˵ҪÓòã´Îhmm£¬ÏÈ´Ö·ÖÔÚϸ·Ö¡£
ÉîÈëѧϰ£ºÖ÷ÀàÊÇnet.imdict.analysis.chineseÖеÄChineseAnalyzer.javaËü¼Ì³ÐÁËluceneµÄ
AnalyzerÀ࣬ÓÐÁ½¸ö¹¹Ôì·½·¨£ºpublic ChineseAnalyzer()¡¢public ChineseAnalyzer
(Set<String> stopWords)µÚ¶þ¸ö¹¹Ôì·½·¨Ö§³ÖÍ£Óôʣ¬×îÖØÒªµÄÊÇtokenStreamº¯Êý£¬ËüÓÃÁË
SentenceTokenizerºÍnew WordTokenizer£¬Ç°Ò»¸öÊǽ«ÎÄÕ·ֳɾä×Ó£¬ºóÒ»¸öÊǽ«¾ä×ӷֳɵ¥´Ê£¬
µ¥´ÊºÍ¾ä×Ó¶¼ÊÇÓÃLuceneµÄToken£¨´Ê£©µÄÀà´æ´¢µÄ£¬£¨TokenÊÇÒ»¸ö³éÏóÀ࣬TokenStreamÊÇToken
ÀàµÄ×ÓÀ࣬µ«Ò²ÊÇÒ»¸ö³éÏóÀ࣬TokenizerºÍTokenFilterÔòÊÇTokenStreamµÄ¾ßÌåʵÏÖ£¬ËûÃÇʵÏÖ
ÁËTokenStreamµÄnext()·½·¨£¬TokenizerµÄnext·½·¨·µ»ØµÄÊÇÔʼµÄ¡¢ÇзֳöÀ´µÄ´Ê£¬¶ø
TokenFilter·½·¨·µ»ØµÄÊÇÒ»¸ö¾¹ý¹ýÂ˵ĴÊÌõ£¬ËûÃǽáºÏÆðÀ´ÐγÉLucene·ÖÎöÆ÷µÄºËÐĽṹ£©Èç
Token token = new Token()£¬È»ºóͨ¹ýtoken.reinit(buffer.toString(), tokenStart,
tokenEnd, "sentence");ÖмäÁ½¸ö²ÎÊýÊÇToken´æ´¢µÄ×Ö·û´®µÄÆðֹλÖã¬ÒÔ0¿ªÊ¼¼ÆÊý£¬ÒýÓÃ
tokenÖÐ×Ö·û´®µÄº¯ÊýÊÇtoken.term()£¬ÕæÕýµ÷Ó÷ִʺËÐÄËã·¨µÄWordSegmenterµÄ
segmentSentence·½·¨¶Ô¾ä×Ó½øÐзִʣ¬ÔÚWordTokenizerÀàÖе÷ÓÃËüµÃµ½·Ö´Ê½á¹û¡£ÔÚÍùϲãµÄ´ú
ÂëÎÒ¾Íû¿´ÁË¡£
Á½¸ö¸Ä¶¯£º
£¨1£©ChineseAnalyzerÖ»ÄܶÔÎļþ½øÐзִʣ¬ÈçºÎ¶ÔÒ»¸ö×Ö·û´®½øÐзִʣ¬¸Ä¶¯ÈçÏÂ
/* TokenStream ts = ca.tokenStream("sentence", new InputStreamRe
Ïà¹ØÎĵµ£º
jdk/binĿ¼ÏÂnative2ascii.exeÎļþ¿ÉÒÔÖ±½Ó½«ÖÐÎÄת³Éunicode.
cmd½øµ½binĿ¼Ï£¬ÔËÐÐnative2ascii.exe¡£ÊäÈëÖÐÎĻسµºó¾ÍÉú³ÉÁËunicode±àÂë¡£
Ö±½ÓË«»÷ÔËÒ²ÐУ¬ÎÒÔÚCMDÏÂÔËÐÐÖ÷ÒªÊÇΪÁË·½±ã½«×ª»»ºóµÄ±àÂ뿽±´³öÀ´¡£
Àý£º
cmd
cd C:\Java\jdk1.5.0_17
C:\Java\jdk1.5.0_17>native2ascii
Ó¬Âñ²Ø &nb ......
ÓйØJava¶ÔÏóµÄÐòÁл¯ºÍ·´ÐòÁл¯Ò²ËãÊÇJava»ù´¡µÄÒ»²¿·Ö£¬ÏÂÃæ¶ÔJavaÐòÁл¯µÄ»úÖÆºÍÔÀí½øÐÐһЩ½éÉÜ¡£
¡¡¡¡JavaÐòÁл¯Ë㷨͸Îö
¡¡¡¡Serialization£¨ÐòÁл¯£©ÊÇÒ»ÖÖ½«¶ÔÏóÒÔÒ»Á¬´®µÄ×Ö½ÚÃèÊöµÄ¹ý³Ì£»·´ÐòÁл¯deserializationÊÇÒ»ÖÖ½«ÕâЩ×Ö½ÚÖØ½¨³ÉÒ»¸ö¶ÔÏóµÄ¹ý³Ì¡£JavaÐòÁл¯APIÌṩһÖÖ´¦Àí¶ÔÏóÐòÁл¯µÄ±ê×¼»úÖÆ¡£ÔÚÕâÀ ......
JAVAºËÐļ¼Êõ¹Ûºó¸Ð
ÕâÖÜ´ó¼Ò¶¼»ù±¾ÉÏÂòÁËÒ»±¾¡¶JAVAºËÐļ¼Êõ¡·À´¿´£¬ËäÈ»ÎÒµÄÊéÏÂÖܲÅÄÜÄõ½£¬µ«ÊÇÎÒ»¹ÊÇ·ÁËϱðÈ˵쬴óÖÂÁ˽âÁËÒ»ÏÂÀïÃæµÄÄÚÈÝ¡£ÒÔϾÍÊÇÎÒ´Ö²ÚµÄÕûÀí¡£
JAVA²¢²»Ö»ÊÇÒ»ÖÖÓïÑÔ£¬¶øÊÇÒ»¸öÍêÕûµÄƽ̨£¬ÓÐÒ»¸öÅÓ´óµÄ¿â£¬ÆäÖаüº¬ºÜ¶à¿ÉÒÔÖØÓõĴúÂëºÍÒ»¸öÌṩÖîÈ簲ȫÐÔ£¬¿ç²Ù×÷ϵͳµÄ¿ÉÒÆ ......
×î½üÔÚѧϰjavaµÄ¿ò¼Ü£¬¼ÓÉîÁ˶ÔjavaһЩÉè¼ÆÄ£Ê½µÄÀí½â£¬ÀýÈçÄ£°å·½·¨Ä£Ê½¡¢²ßÂÔģʽ¡¢´úÀíģʽºÍ¶¯Ì¬´úÀíģʽ£¬·¢ÏÖѧϰÀí½âservletºÍjspºóºÜÈÝÒ×¾Íѧ»áÁËstruts£¬Ñ§Ï°jdbcºó±È½ÏÈÝÒ×Àí½âhibernate£¬±¾ÈË»¹Ã»¿ªÊ¼Ñ§Ï°spring¡£
¶Ôstruts¿ò¼ÜµÄÀí½â£ºÖ÷ÒªÊ ......
ÓÃspyºÍmemcached for javaÁ½ÖÖ·½Ê½¶Ômemcache½øÐвÙ×÷
Ò»¡¢spy
package com.test.memcache;
import java.net.InetSocketAddress;
import java.util.concurrent.Future;
import net.spy.memcached.MemcachedClient;
/**
* ±¾ÀàÓõİüÊÇmemcached-2.4.1.jar
* ÏÂÔØµØÖ·£º http://code.googl ......