wordתhtmlÈçºÎÇå³ýÈßÓà´úÂë
ÎÒÓм¸Íò¸ö´ÓwordתÀ´µÄhtmlÎļþ£¬µ«ÕâЩhtmlÎļþÓÉdocµÄ100¶àK±ä³ÉÁ˼¸M£¬¼¸Ê®M¡£
ÔÀ´×ªÎªhtmlʱ²úÉúÁË´óÁ¿µÄÈßÓà´úÂ룬ÇëÎÊÓÐʲô·½·¨¿ÉÒÔÇå³ýÕâЩÀ¬»ø¡£
ÐèÒª³ÌÐò´úÂë¡£
¸Õ²Åû·ÖÁË£¬ÏÖÔÚÓÖÓÐÁË£¬¿ÉÒÔ¼Ó·ÖµÄ
/// <summary>
/// ÇåÀíWordÉú³ÉµÄÈßÓàHTML
/// </summary>
/// <param name="html"> </param>
/// <returns> </returns>
public static string CleanWordHtml(string html)
{
StringCollection sc = new StringCollection();
// get rid of unnecessary tag spans (comments and title)
sc.Add(@" <!--(\w|\W)+?-->");
sc.Add(@" <title>(\w|\W)+? </title>");
// Get rid of classes and styles
sc.Add(@"\s?class=\w+");
sc.Add(@"\s+style='[^']+'");
// Get rid of unnecessary tags
//sc.Add(@"
Ïà¹ØÎÊ´ð£º
ÊÖ»úÄÜ´ò¿ª.htmlµÄÍøÕ¾,Ϊʲô»¹Òª×öwapÍøÕ¾ÁË?,,,ÊÖ»úä¯ÀÀwapÍøÕ¾ÓÐʲôºÃ´¦
ÎÒÃǹ«Ë¾×öµÄwap¾ÍÊÇhtmlµÄ¡£
¹Ø×¢
ºÜ¶àµÍ¶ËµÄÊÖ»ú¶¼»¹ÊÇÖ»ÄÜ¿´wml¸ñʽµÄÀ²£¬wml±¾À´¾ÍÊÇרÃÅÕë¶ÔÊÖ»úÖÆ¶¨µÄÒ»Ì×Ò³ÃæÏÔʾÓïÑÔÀ²£ ......
×Ö·û´®×ª»¯Îª HTML ʵÌ壿 Ôõôд×î¼òµ¥£¿
±ÈÈç°Ñ¡°ÄãºÃ¡±
Êä³öΪ£º
你 好
²»ÐÐ
C# code:
string str = "ÄãºÃ";
......
½«.net´úÂëÇгÉHTMLÔ´ÂëµÄ·½·¨£¿Çë¶à¶àÖ¸½Ì£¡
ÕâÊÇÒª×öʲô Ö±½ÓÔËÐкó²é¿´Ô´Îļþ?
ÔËÐУ¡²é¿´Ô´´úÂ룡
ÄãÏëÒª¸ÉÂ
¾ÍÊÇÔËÐкóÒ³ÃæµÄÔ´ÂëΪHTMLµÄ´úÂë ¶ø²»ÊÇ¡£netÔ´Â룡
¿ÉÒÔÓÃjscriptºô½Ð³öht ......
Ôõô²Å¿ÉÒÔÈ¥µô <html:file>ÖеÄÄǸöÊäÈë¿ò£¬ÈÃÒ³ÃæÖ»ÏÔʾÄǸöä¯ÀÀ°´Å¥£®»òÊÇÓÃÒ»¸öbuttonÀ´×ö£¬µ±µã»÷Ò»¸öbutton¾Í¿ÉÒÔä¯ÀÀ±¾µØµÄÎļþ¼Ð£®
ÎÒÊÇдÁËÒ»¸öbuttonºÍÒ»¸ö <html:file>±êÇ©
È»ºó ......
ÎÒÓÃWebBrowserÔØÈëÒ»¸öÍøÒ³
È»ºó¶ÁÈ¡±£³Öhtmlµ½Îļþ
·¢ÏÖºÍʵ¼ÊµÄ²î±ðºÃ´ó¡£ºÜ¶àλÖö¼³öÏÖÂÒÂë
ÔÙieÖб£´æ³öÀ´µÄ¾ÍûÎÊÌâ
²»ÖªµÀÔõô»ØÊÂ
Function getWebHtml(browser As Web ......