Ò׽ؽØͼÈí¼þ¡¢µ¥Îļþ¡¢Ãâ°²×°¡¢´¿ÂÌÉ«¡¢½ö160KB

ËÑË÷֮· c#´ÓhtmlÖÐÌáÈ¡Îı¾

Ö±½Ó·â×°³ÉÒ»¸öÀàµÄ£¬ÓÃÆðÀ´»¹Í¦·½±ãµÄ
using System;
using System.Data;
using System.Configuration;
using System.Web;
using System.Web.Security;
using System.Web.UI;
using System.Web.UI.WebControls;
using System.Web.UI.WebControls.WebParts;
using System.Web.UI.HtmlControls;
using System.Text.RegularExpressions;
/// <summary>
/// HtmlExtract ³éÈ¡htmlÀïÃæµÄÎı¾ÐÅÏ¢
/// </summary>
public class HtmlExtract
{
   
        #region private attributes
        private string _strHtml;
        #endregion
        #region public mehtods
         public HtmlExtract(string inStrHtml)
        { _strHtml = inStrHtml;}
        public string ExtractText()
        {
            string result = _strHtml;
            result = RemoveComment(result);
            result = RemoveScript(result);
            result = RemoveStyle(result);
            result = RemoveTags(result);
            return result.Trim();
        }
        #endregion
     #region private methods
       private string RemoveComment(string input)
{
string result = input;
//remove comment
result = Regex.Replace(result, @"<!--[^-]*-->", string.Empty, RegexOptions.IgnoreCase);
return result;
}
     


Ïà¹ØÎĵµ£º

»ùÓÚSpring¿ò¼Ü·¢ËÍHTML¸ñʽµÄÓʼþ

// ³éÏóÀà package com.albert.email.mime; import org.springframework.mail.javamail.JavaMailSender; public abstract class AbstractMessageSender {
    protected String from;
    protected String to;
    protected String subject; ......

ÄãÕæµÄÁ˽âhtmlÂ𣿠֮script±êÇ©½éÉÜ

Ò»£®ÊôÐÔ£º
src£º¸ÃÊôÐÔÖ¸¶¨ÍⲿJavaScriptÎļþµÄµØÖ·£¬¿ÉÒÔ¿çÓò¡£Èç¹ûÖ¸¶¨Á˸ÃÊôÐÔ£¬ÄÇôscript±êÇ©ÖеÄÄÚÈݾͻᱻºöÂÔ¡£
 
type£ºÖ¸¶¨ËùʹÓõĽű¾ÓïÑÔ£¬¿ÉÒÔ¸²¸ÇĬÈϵĽű¾ÓïÑÔ£¨¼´ÔÚ<META http-equiv="Content-Script-Type" content="type">Öж¨ÒåµÄ½Å±¾ÀàÐÍ
£©¿ª·¢Õß±ØÐëÖ¸¶¨¸ÃÊôÐÔ£¬¸ÃÊôÐÔûÓÐĬÈÏÖµ¡ ......

html±í¸ñ±ê¼Ç

HTMLÓÐ10¸ö±í¸ñÏà¹Ø±êÇ©¡£ÏÂÃæÊÇÒ»¸ö´øÓмò½éµÄÁÐ±í£¬µ«ÊÇÊ×ÏÈ£¬ÎĵµÒª±»ÕýÈ·µÄ¶¨ÒåÔÚHTML 4.01/XHTML 1»òHTML 5ÏÂÃæ:
<caption> ¶¨Òå±í¸ñ±êÌâ(4, 5)
<col> Ϊ±í¸ñµÄÁж¨ÒåÊôÐÔ(4, 5)
<colgroup> ¶¨Òå±í¸ñÁеķÖ×é(4, 5)
<table> ¶¨Òå±í¸ñ(4, 5)
<tbody> ¶¨Òå±í¸ñÖ÷Ìå(4, 5)
< ......

vim html xml ×Ô¶¯²¹È«(ÔÚclosetag.vim ÉÏ×÷Á˵ãÐÞ¸Ä)

µ±ÊäÈë ¡·Ê±×Ô¶¯²¹È« µ±ÊäÈ롶/ʱ×Ô¶¯²¹È«
“£½£½£½£½£½£½£½£½£½£½£½£½£½£½£½£½£½£½£½£½£½£½£½£½£½£½£½£½£½£½£½£½£½
" File: closetag.vim
" Summary: Functions and mappings to close open HTML/XML tags
" Uses: <C-_> -- close matching open tag
" Author: Steven Mueller <di ......
© 2009 ej38.com All Rights Reserved. ¹ØÓÚE½¡ÍøÁªÏµÎÒÃÇ | Õ¾µãµØͼ | ¸ÓICP±¸09004571ºÅ