wordתhtmlÈçºÎÇå³ýÈßÓà´úÂë
ÎÒÓм¸Íò¸ö´ÓwordתÀ´µÄhtmlÎļþ£¬µ«ÕâЩhtmlÎļþÓÉdocµÄ100¶àK±ä³ÉÁ˼¸M£¬¼¸Ê®M¡£
ÔÀ´×ªÎªhtmlʱ²úÉúÁË´óÁ¿µÄÈßÓà´úÂ룬ÇëÎÊÓÐʲô·½·¨¿ÉÒÔÇå³ýÕâЩÀ¬»ø¡£
ÐèÒª³ÌÐò´úÂë¡£
¸Õ²Åû·ÖÁË£¬ÏÖÔÚÓÖÓÐÁË£¬¿ÉÒÔ¼Ó·ÖµÄ
/// <summary>
/// ÇåÀíWordÉú³ÉµÄÈßÓàHTML
/// </summary>
/// <param name="html"> </param>
/// <returns> </returns>
public static string CleanWordHtml(string html)
{
StringCollection sc = new StringCollection();
// get rid of unnecessary tag spans (comments and title)
sc.Add(@" <!--(\w|\W)+?-->");
sc.Add(@" <title>(\w|\W)+? </title>");
// Get rid of classes and styles
sc.Add(@"\s?class=\w+");
sc.Add(@"\s+style='[^']+'");
// Get rid of unnecessary tags
//sc.Add(@"
Ïà¹ØÎÊ´ð£º
c# ÓÐûÓÐÓÃÓÚC/SµÄ htmlÎı¾±à¼Æ÷
¾ÍÏñweb½çÃæµÄÎı¾ÄÚÈÝ±à¼Æ÷
up
C/S»¹Òª±àÒëÆ÷¸ÉÂï°¡
ÓÖ²»ÓÿØÖÆÑùʽ
Ã²ËÆÃ»ÓÐ°É Èç¹ûÓÐÁË֪ͨÏÂÎÒ ÎÒÒ²Òªliujintaohfbb@163.comÎÒµÄÓÊÏä ......
ÓÃjavascript
ÈçºÎ¸´ÖÆÐÅÏ¢µ½¼ôÌù°å¡¾²»´øHTML±ê¼ÇµÄ¡¿
ºÜ³£¼ûµÄÒ»¸öÀý×Ó£¬±ÈÈçÎÒÔÚÂÛ̳·¢Ìû×ÓµÄʱºò£¬ÎÒÔÚ±à¼Æ÷Àï±à¼ºÃÒ»¶ÎÎÄ×ÖÖ®ºó£¬µãÌá½»¡£
ÕâʱºòJS×Ô¶¯°ïÎÒ¸´ÖÆÁË¡£
Èç¹ûÍòÒ»³ö´í£¬ÎÒ»¹¿ÉÒÔ ......
ÈçÌâËùʾ£¬´¦ÓÚijÖÖÐèÒª£¬ÐèÒª£¨ÎÞ·¨¸Ä±ä£©ÔÚhtml×îÍ·¶Ë¼ÓÉÏÒ»¶ÎJavaScript´úÂë¡£¿ÉÊǼÓÉÏÖ®ºó¾ÍÓ°ÏìÁËÒ³ÃæµÄÏÔʾ£¬ÓÐûÓÐÄÄÖÖ°ì·¨¿ÉÒÔ±ÜÃâÕâÖÖÇé¿öµÄ
ÈçÏÂËùʾ£¬ÔÀ´µÄhtmlHTML code:
<!DOCTYPE html P ......
ÎÒÒªµ÷ÊԽű¾ÓïÑÔ ÔÚVS2008ÀïHTMLµÄ½Å±¾²»Äܵ÷ÊÔ
ÔõôÄܰÑhTMLת»»³Éasp.netÎļþ
Áí´æÎªaspxÎļþ...
ºÇºÇ
ÎÒ˵µÄÓе㲻¶ÔÁË
ÄÇÀïÓÐÍâÃæÒýÈëµÄJSÎļþ HTML Ö±½Óµ÷ÓÃÄÇÀïµÄº¯Êý¿ÉÊÇ×°³ÉASPXºó¾ÍÕÒ²»µ½Ä ......
ÎÒÓÃWebBrowserÔØÈëÒ»¸öÍøÒ³
È»ºó¶ÁÈ¡±£³Öhtmlµ½Îļþ
·¢ÏÖºÍʵ¼ÊµÄ²î±ðºÃ´ó¡£ºÜ¶àλÖö¼³öÏÖÂÒÂë
ÔÙieÖб£´æ³öÀ´µÄ¾ÍûÎÊÌâ
²»ÖªµÀÔõô»ØÊÂ
Function getWebHtml(browser As Web ......