请教ggao 的写法

新手入门、经验交流

Moderator: cre

Forum rules
  • 软件自带帮助是最好的入门教程,请仔细阅读;在您掌握最基本知识前,我们没法帮助您
  • 提问前请先查阅软件帮助文件和论坛相关讨论
  • 查阅软件帮助和论坛相关讨论即可解决的问题帖将直接删除,以免浪费他人时间
  • 请学会如何提问,尽量将问题描述清楚,不明确的提问将被直接删除
Post Reply
sixsheeps
Posts: 30
Joined: Jan 17 2008, 14:59

请教ggao 的写法

Post by sixsheeps »

SCRIPT language=javascript src="/ggao/all_TL.js"></SCRIPT>
SCRIPT language=javascript src="/ggao/js_index_left.js"></SCRIPT>
SCRIPT language=javascript src="/Ggao/Sohu_Ggao.js"></SCRIPT>
SCRIPT language=javascript src="/Ggao/wenzi.js"></SCRIPT>
SCRIPT language=javascript src="http://www.crsky.com/Ggao/wenzi_down.js"></SCRIPT>
SCRIPT language=javascript src="/ggao/down_468.js"></SCRIPT>
script language="javascript" src="/Ggao/index-205.js"></script>
Script language=javascript src="/ggao/left.js"></Script>
SCRIPT src="/ggao/lefts.js"></SCRIPT>
script src="/ggao/downd_a.js"></script>
script src="/ggao/index_TL.js"></script>
script src=/ggao/index-buttom.js>

iframe frameborder=0 src=http://www.crsky.com/ggao/all_buttom.htm width=760 height=60 scrolling=no></iframe>
iframe frameborder=0 src=/ggao/buttom4.htm width=180 height=120 scrolling=no></iframe>
iframe frameborder=0 src=http://www.crsky.com/ggao/all_buttom.htm width=760 height=60 scrolling=no></iframe>
iframe frameborder=0 src=http://www.crsky.com/ggao/l6_1.htm width=530 height=60 scrolling=no></iframe>
iframe frameborder=0 src=/ggao/L6.htm width=530 height=60 scrolling=no></iframe>
IFRAME src="/ggao/down.htm" frameBorder=0 width=530 scrolling=no height=60></IFRAME>



这是最近找出来的霏凡软件的页面广告代码。发现大部分的代码都是:1 在script 和 iframe里面;2 src 属性内容里面有/ggao/;

我记得有一个语句是取属性值的。但是怎么用可以让这个语句有判断的功能呢?
User avatar
phoenix
Site Admin
Posts: 529
Joined: Dec 29 2007, 16:27

Re: 请教ggao 的写法

Post by phoenix »

匹配属性值可以用$AV()命令,以下代码将匹配你给出的例子:

Code: Select all

(script|iframe)\1\s*src=$AV(*/ggao/*)*</\1>
相关讨论:http://forum.proxomitron.cn/viewtopic.php?f=3&t=52
Ray4
Moderator
Posts: 65
Joined: Dec 30 2007, 09:37

Re: 请教ggao 的写法

Post by Ray4 »

感觉crsky的广告直接用url killfile.txt过滤的效果也可以让人接受,现在多数过滤我都是用这个的
sixsheeps
Posts: 30
Joined: Jan 17 2008, 14:59

Re: 请教ggao 的写法

Post by sixsheeps »

url killfile.txt???

能否说的明确一些呢。是不是sidki规则里面的呢?我是把全部规则重写的。对于一些通用的广告过滤,能不能分享一下成果?呵呵……也好,研究学习
imsheng
Posts: 37
Joined: Jan 08 2008, 13:24

Re: 请教ggao 的写法

Post by imsheng »

霏凡的广告系统写得还不错的。

我专门去研究过。他把广告的代码变量写到好几个js里面。如果你过滤了某个js,那下载链接就显示不出来了。

operachina论坛的somh专门写了个js,来修复这个问题。
感觉crsky的广告直接用url killfile.txt过滤的效果也可以让人接受,现在多数过滤我都是用这个的
我之前也是,但后来发现很多空白。现在过滤成对标签了。这样页面没那么空白。

url killfile.txt就是adlist这个变量对应的blockfile。不是sidki规则包里的。

我最近也开始研究sidki规则包。感觉还可以。但好多不认识的变量。呵呵

$TYPE(htm)和$TYPE(js)里加个 */ggao/*

增加$NEST(<iframe*,</iframe>)和$NEST(<script*,</script>)。

[Patterns]
Name = "crsky.ggao"
Active = TRUE
URL = "$TYPE(htm)|$TYPE(js)"
Bounds = "$NEST(<iframe*,</iframe>)|$NEST(<script*,</script>)"
Limit = 30000
Match = "<$LST(iframe|script)"
Replace = "<!-- Kill Ad -->\0"

但貌似上面这个pattems效率不好。limit太大了。
imsheng
Posts: 37
Joined: Jan 08 2008, 13:24

Re: 请教ggao 的写法

Post by imsheng »

这个论坛的贴子允许编辑的时间太短了吧。

上面那个pattems只是凭想象写出来的。应该不能用。但我的思路是那样。

还有其他的好办法来擦除成对的标签吗?我嫌那个byte limit太大了。
Ray4
Moderator
Posts: 65
Joined: Dec 30 2007, 09:37

Re: 请教ggao 的写法

Post by Ray4 »

sixsheeps wrote:url killfile.txt???

能否说的明确一些呢。是不是sidki规则里面的呢?我是把全部规则重写的。对于一些通用的广告过滤,能不能分享一下成果?呵呵……也好,研究学习
这个是proxomitron本来就带的,是[HTTP headers]规则里的,名字应该是这个 "URL-Killer: kill-a-URL (out)"(不记得自己动过它的名字没, :lol:

其实规则很简单,就是kill某个链接,很多广告实际就是一个iframe,或者一个独立的js文件,找到它们 的地址,直接放到此文件中即可,看点就是LS所说的,往往会造成空白,如果觉得碍眼,可用内容过滤。

规则不是不拿出来分享,只是自己根本不懂html、js,过滤全凭感觉和工具,怕给出来害了人 :oops:
下面是我的,很多是刚开始用proxomitron时写的,肯定会有错的,只是自己没发现而已,请谨慎使用

Code: Select all

#
# Proxomitron Killfile
#
# The URL killer header filter, if enabled, will kill 
# any URLs matched in this list.  This will completely
# block access to the given site so be cautious.
#
# Note: the AdKey list also calls this list to give the
# banner filters a chance to block these items before
# they reach your browser.
#
# List Indexing:  New in Naoko-4 BetaFour is the ability
# to index some regular expressions. This can greatly speed
# lookup in large lists. No changes to existing lists are necessary
# but to fully take advantage of these features it helps to know
# what can and can't be indexed. This includes...
#
# 1) lines that begin without any wildcards can be index up to the
#    first wildcard.  As in "www.somesite.com/\w/ads.html"
#
# 2) URLs with specific leading wildcards and no further wildcards
#    for the rest of the hostname up to and including the first "/" 
#    For example...
#
#    *.somesite.com/
#    \w.somesite.com/
#    [^.]+.somesite.com/
#    [^/]++somesite.com/
#    ([^/]++.|)somesite.com/    
#    
#    Valid leading wildcards include *, \w, [...]+, [...]++, and (...|)
#    Note that you must include the "/" before any further wildcards. 
#    For instance, "[^.]+.somesite.(com|net)/" won't be indexable
#    because the host includes both leading and trailing wildcards.
#
#
#

# The WWW's
www.clickXchange.com/
www.custom-click.com/
www.eads.com/
www.hotlinks.com/images/ban
www.maximumcash.com/
www.net3media.com/
www.PayPopup.com/
www.popupad.net/
www.sitetracker.com/
www.TheCounter.com/
www.whatsfind.com/

# Matches that disregard 1st sub-domain
[^.]+.adsag.com/
[^.]+.adserver.com/
[^.]+.advertising.com/
[^.]+.atdmt.com/
[^.]+.atwola.com/
[^.]+.counted.com/
[^.]+.hotrank.com/
[^.]+.msads.net/
[^.]+.mtree.com/
[^.]+.nedstatbasic.net/
[^.]+.sitemeter.com/

# Specific ad server hostnames (before the first "/")
[^/]++.adbureau.net/
[^/]++.adquest3d.com/
[^/]++.adserver.com/
[^/]++.advert.com/
[^/]++.burstnet.com/
ad.(([a-z]+{1,4}.|)doubleclick.net/|ad8ad.com/)
[^/]++.fastclick.com/
[^/]++.fastclick.net/
[^/]++.flycast.com/
[^/]++.gator.com/
[^/]++.hitbox.com/
[^/]++.imrworldwide.com/
[^/]++.linkexchange.com/ 
[^/]++.mediaplex.com/
[^/]++.orbitcycle.com/
[^/]++.usads.futurenet.com/
[^/]++.valueclick.com/
[^/]++.valueclick.net/
[^/]++extreme-dm.com/
[^/]++.all4ad.cn/(mtunion/display.aspx|vostro/)


mediamgr.ugo.com/
us.a1.yimg.com/
rd.yahoo.com/M=[0-9.]+{5,*}/
ar.atwola.com/

# Match leading hostname components (ads.something  adserver.something ...)
(us.|cn.|)(
	ad(s(erv(e(r|rv|)|ing)|force|(s|)how|tegrity|c|)|click|client|v)([#0:200]|)|
	leader|
	popme|
	sponsor|
	gg|
	counter).

# Specific pathname components (after the first "/")
$TYPE(htm)$TYPE(js)$TYPE(vbs)[^/]+\w[/?.](ad(s|erver|verts|log.pl)[0-9]+/|banner(s|)/|GetAd|cookie.crumb|js_source/|hitometer.js|index.cgi\?id=???)

# User added URLs
# ----------------- google ------------------------------
pagead2.googlesyndication.com/(pagead/show_ads.js|pagead/iclk|(pagead/|cpa/)ads\?client\=)
www.google.com/afsonline/show_afs_(ads|search).js
www.google-analytics.com/(urchin|ga|siteopt).js
mail.google.com(:[0-9]+{1,6}|)/mail/*view=ad\&
groups.google.co(m|.)([a-z]+|)/groups/adfetch\?adid\=
partner.googleadservices.com/gampad/(google_|)ads
www.google.com/coop/cse/brand\?form\=
#! Google Reader
lh[0-9].google.com/scavina/*.gif

# ----------------- baidu ------------------------------
cpro.baidu.com/cpro/ui/(uijs.php|(ui|cp|iknow).js)
eiv.baidu.com/(other|(ma|hy)img|mapm2)/*.(jpg|gif|swf|js)
unstat.baidu.com/bdun.bsc\?tn
spcode.baidu.com/spcode/(spstyle/style[0-9.]++jsp\?tn\=|baiduunion/inside_tv.js(^?))
box.zhangmen.baidu.com/js/bd_Adv.js
utk.baidu.com/usv/uc.sv\?
baidutv.baidu.com/iknow.html\?pt\=


# ----------------- alibaba -------------------------------
((a|p).alimama.(com|cn)|(((cm|tmatch).p4p|stat.aliunion).cn.yahoo.com))/(inf([1-9]|).js|tbuad\?pid\=|cm\?(u|r)\=)
z.alimama.com/cps.php\?
p.alimama.com/code.php\?
style.china.alibaba.com/js/news/blog_r_show.js
img.alimama.cn/(a|cpa/xmas)/*.(gif|jpg)
cm.yahoo.com.cn/inf.js(^?)
themis.geocities.yahoo.com/themis/h.php\?curl\=
#! Yahoo
[a-z0-9]+.adserver.yahoo.com/
p.mm.cn.yahoo.com/alimamal.php\?
cn.yimg.com/((a)/(ad|alen/js)/|search/kl/)*.(gif|js|swf)(^?)
cn.yimg.com/i/js/ymw/b02/ymw-window-min.js(^?)
us.i1.yimg.com/us.yimg.com/i/mc/mc.js(^?)
us.js2.yimg.com/us.js.yimg.com/lib/smb/js/hosting/cp/js_source/*.js(^?)
view.aliunion.cn.yahoo.com/(show|sel)ad.php

#!------------------- MyDrivers ---------------------------
images.mydrivers.com/20[0-9]+{2}[#1:12]/[^/]++.(swf|gif)
www.shangdu.com/sdnews.shtml
[#1:400].mydrivers.com/hardware/[#1:4000].js

#! ------------------ Enet -----------------------
www.enet.com.cn/home/images/v[#1:15]/[a-z1-9]+.swf
www.enet.com.cn/enetshow.shtml\?

#! ------------------ skycn -----------------------
(www([1-3]|)|tele|ctc).skycn.com/js/(((inside|sort)_head_|(def_|))[#1:30].js|(index|sort)_mid_(n|ex|jqrd|dhlb|)[#1:30].js|bd(un|cpro[1-9]).js|ad_soft_(right|left).js|skycn-count.js|index_head_(fun|)[#1:30].js|soft_*.js(^?))
(www([1-3]|)|tele|ctc|news).skycn.com/(js-swf/index.html|tuijianimg/*.(gif|jpg|swf)|skycn-count.js)
news.skycn.com/js/article_head_[#1:30].js
tk.baidu.com/tk-rcv/sv/cj.php\?pe\=
ad[0-9]+.adpolestar.net/ADPolestar/


# ----------------- PcPop ------------------------------
[^/]++.pcpop.com/statistic/viewcodead.aspx\?AdId\=

#!----------------- CuiWeJu -----------------------------
read.cuiweiju.com/configs/article/pagefoot.js

#! CnBeta
monitor.yigao.com/servlet/beforeAdShow
www.cnbeta.com/images/((576_45z|590_45).gif|ken1ken576x45.swf|ad[#1:300].jpg)
www.cnbeta.com/counter.php\?sid\=
monitor.yigao.com/info.js
online[0-9].lianmeng.com/cn/[0-9]+{2}/[0-9]+{5}_[0-9]+{6}.php
www.cnbeta.com/counter.php\?sid\=[0-9]+{4,6}

(media.yesky.com/(images|adjs)|image.yesky.com/Tlimages)/*.(jpg|gif|swf|js)
#media.yesky.com/adjs/*.(htm(l|)|js)
vd.yesky.com/html/*.js

#! ---------------tianya ---------------------
image.tianya.cn/guide/FooterGuideJs/tianyaFooterJs(100percent|).js
www.tianya.cn/new/js/keyword_replace.js
advertisement.tianya.cn/[#1900:2100]/[#1:12]/*.(swf|jpg|gif|htm(l|))
[0-9]+{1,4}.tianya.cn/(res/|)[#1900:2020]/[#1:12]/*.swf
www.tianya.cn/new/images/vichy_adv/new/[a-z0-9]+.swf

#! Sina
(ad4|d1).sina.com.cn/
pfp.sina.com.cn/*.(html|js)
you.video.sina.com.cn/v/iframe/*.(js|html)
i3.sinaimg.cn/unipro/pub/*.js
woocall.sina.com.cn/rls/(int/|)stable.js
js.icast.cn/a/[0-9/]++.js
rm.sina.com.cn/(icast/rotator|bj-icast/mv/*/icast).js
ent.sina.com.cn/iframe/musicjs/importplayer.0.2.js
image2.sina.com.cn/book/forum/s2/js/minimessage.js
iask.com/innews/ad\?
$OHDR(Referer: http://www.sina.com.cn/(^?))finance.sina.com.cn/iframe/js/[a-z_0-9]+.js(^?)
#! SinaBBS
bbs.sina.com.cn/mini/focus.html

#! Sina blog floating htm
blog.sina.com.cn/lm/html/[0-9]+{1,4}-[0-9]+{1,2}-[0-9]+{1,2}/*.js

# ------------------ XinHua ----------------------

#! crsky
un.265.com/index.htm\?id

#! ------------ MACD --------------
bbs.macd.cn/xc/[a-z0-9\-]+.(swf|gif|jpg)

#! ------------ Duba --------------
u.www.duba.net/union/

#! ------------ Tencent QQ ----------------
adsfile.qq.com/*.(swf|jpg|gif)
ads(text|)view([0-9]|).qq.com/adsview\?c\=
tiapview.qq.com/tad.js\?tid\=

#! KeyRun
u.keyrun.com/(voo.php|js/*.js)
code[#1:200].keyrun.com/

#! GZ
qa.colorful.cn/Show/(color|Script).aspx\?Guid\=
we.pcinlife.com/banner/company/*.swf

#! 163
(adimg|proimg).163.com/channel[0-9]+/*.(swf|gif|html|jpg)
mimg.163.com/external/[^/.]+.js(^?)
impservice.yodao.com/imp/
images.163.com/js/*.html(^?)
www[0-9]+.clickeye.cn/(js.js\?address|common/clickeye.js(^?))
#control.clickeye.com.cn/core.jsp\?url\=
cpc.163.com/Bidding/servlet/BidAdsCode\?
(adclient|pro).163.com/(html|js).ng/site\=
www[0-9]+{1,3}.clickeye.cn/*.js
images.163.com/js/ebay_cash*.html

#! Pconline
image.pconline.com.cn/images/*.gif
img([1-3]|).pconline.com.cn/(pconline/[0-9]+{4,5}/[0-9]+{1,2}/*.swf|ivy/image/*.(swf|gif|jpg))
image.pconline.com.cn/(ivy/image/*.(swf|gif|jpg)|gz/)
*/adpuba/show
www([1-3]|).pconline.com.cn/pconlinebbs/bbs/hzh/(diy|nb|sm|rj|sj)
www([1-3]|).pconline.com.cn/(*/|)js/(ad_*|ivy).js
www([1-3]|).pconline.com.cn/(product/template/|market/price/js/[a-z]+{1,4}_)menu_fubiao([0-9]|).js

#! IT168
software.it168.com/publishbottom/publishbottom.htm(^?)
www?.it168.com/jtzt/qianlan/software/software.html(^?)

# CSDN
z.csdn.net/(gm|ad)js.php\?
www.csdn.net/ggmm/

#! Sohu
adsence.sogou.com/
images.sohu.com/cs/
www.sohu.com/sohuflash_[0-9]+.js
sogou.com/cpc_ads\?pid\=
news.sohu.com/upload/cs/cswb001/news/main.htm
#! sogou MP3
mp3.sogou.com/righter.so
mp3.sogou.com/frag/music/pages/music_?.jsp
ting.mbox.sogou.com/frag/music/pages/(listen_*.jsp(^?)|down_?.jsp\?adInfo\=)


#! Newhua
*/newhuagg/(^news_)*.(js|swf|gif|jpg)
news.newhua.com/Newhua_Files/Newhua_AD/*.(swf|gif)
www.adsky.net/code/show.(js|php\?ad_client\=*)

#! Pchome
btn.pchome.net/
btm.pchome.net/index.php\?btmList\=DOWNLOAD_HEAD

#! vista123
www.vista123.com/script/3.js

#! ZOL
pic.zol.com.cn/*.(swf|gif)
www.zol.com.cn/(home_ad/endad|js/swfobject).js
stat.zol.com.cn/ol.(js|php\?ad_key_list)


#! VeryCD
(sponsor.verycd.com/bccv|222.37.134.125|(c|)img.awocity.cn)/verycd/

# shooter
shooter.cn/jscript/siteadv.js


#! Flashget
ufile.kuaiche.com/(Flashget_union(_bd|).php\?fg_uid|form.php\?sid)\=
ufile.kuaiche.com/search/product[0-9]+.php?sid\=
#! donews
[^/]++.donews.com/(js|AdvScripts)/adv[^/]++.js

#! Mop
gg.mop.com/imgs/*.(jpg|gif)
mopgg.mop.com/adshow.jsp

## ---------- allyes ----------------
#\w.allyes.cn/main/adfshow\?user\=

www.ayxz.com/images/*.js

#----------- IMDB ------------------
i.imdb.com/3pads/
content.pulse360.com/[a-z0-9-]+

# tiexue
ad.tiexue.net/

#! ads8
u.ads8.com/showpage.php\?pid\=
cpa.ads8.com/wenjie/\?
code[0-6]+.ads8.com/pds_k/(js/*.jsp|view.php)

#! ---------------- 02345 --------------------
www.02345.com/js/(gad|sad2|sou3).htm
new.ete.cn/?show.php\?id\=[0-9]+
new.ete.cn/code_[^.]+.php\?adplace_id\=

#! -------------- booksky --------------------
code.comuce.com/bdcode/[0-9]+.html\?userid\=

#! Taobao
www.taobao.com/home/js/sys/p4p/yp4p.js\?t\=[0-9]+.js
www.taobao.com/home/js/chl/frontpage.js\?t\=[0-9]+.js
assets.taobaocdn.com/js/sys/p4p/express_behavior.js\?
auction1.taobao.com/auction/auto/game_auto.htm\?at_iframe\=1
assets.taobaocdn.com/js/(sys/p4p/express_hesper.js\?t\=|event/pdt_2008_hesper.js)
tmatch.p4p.cn.yahoo.com/tbjad\?
# popup
www.taobao.com/home/js/(ainfo.js\?r\=[0-9]+|sys/(ad|default)backpop.js)

#! youku
html.lstat.youku.com/log.js\?ab\=

#---------------------- Counter ------------------------------
js.users.51.la/[0-9]+.js
img.users.51.la/[0-9]+.asp
stat.aliunion.cn.yahoo.com/stat.js\?(pub|w)id\=
count.yisou.yahoo.com/search/web\?s\=
js.tongji.(yahoo.com.cn|cn.yahoo.com)/*/ystat.js
count[0-9]+.51yes.com/click.aspx\?id\=
(v|s|w)[0-9]+.cnzz.com/(stat|c).php?id\=
[a-z0-9]++.allyes.c[a-z.]+/((main/|)(smcreative/(newff/flashpop*|[^/]++).js|(adf(track|show|click)|s)\?(user|db))|*.js)
analytics.163.com/ntes.js
*.sitemeter.com/js/counter.js\?site\=
www.statcounter.com/counter/counter.js
ww(w|).50bang.com/click.js\?user_id\=
clkstat.qihoo.com/qhstat.js
unstat.baidu.com/bdun.bsc\?tn\=
u.juhe.com/count.php\?uid\=
vip1.t2t2.com/visit.js
goto.www.iciba.com/kds2/kds2_record.js
xslt.alexa.com/site_stats/js/t/a\?url\=
#! Thunder
(p4|)pstatic.xunlei.com/(js/(webThunderDetect|base64)|p4p_keyword/*).(js|gif|jpg)
areainfo.xunlei.com:81/zoneinfo.js*
float.sandai.net/finalfiles/*.swf
biz5.sandai.net/portal/(*.(swf|gif|jpg|js)(^?)|search/redirect.htm)
recommend.xunlei.com/((client_mini_new|all).html|/js/clientMiniNew.js)
t3.wrating.com/a1.js
www.clicki.cn/refer/


#! --------- gougou ------------
un.so.gougou.com/js/SearchView.js

#! hanzify
www.hanzify.org/Main/Template/(Js|images)/a-d-s/

# Namipan
www.namipan.com/inc_
# fs2you
www.fs2you.com/media/3rd/
#! other
*/ad(v|s|)(union|img|file|js|publish|-common|)/*.(js|swf|gif|jpg|htm|html)
(v|s)[0-9]+.cnzz.com/stat.php\?id\=
www.7liao.info/top.js
monitor.yigao.com/servlet/beforeAdShow\?publisherId
bbs.pc3w.com/inc/Dv_Adv.js
(pics|play).unionsky.cn/(flash/|(soft|)show/\?placeID\=[1-9]+)
www.xvlink.com/sc/[0-9]+.htm\?user\=
creative.unionsys.bolaa.com/
vip\-qq\-entertainment\-\-\-\-\-idoican\-(b|e).comnetcnmobiorggovcninfo.cn/Code/
down.chinaz.com/other/js/A_[#1:100].js
www?.netsun.com/adu/\?uid\=
(www([1-4|)|cnc|ctc).9v.cn/*.gif
www.creative-weblogging.com/openads/www/
c.advadmin.com/k.html\?id\=
www([1-9]|).sz1001.net/(www/[^/]++.htm|Down_Style/[a-z0-9_]+.js)
www.orsoon.com/orsoon/[^/]++.js
online?.lianmeng.com/[a-z]+{1,3}/[#1:12]/[0-9\_]+.php
code[#1:200].1133.cc/pds_k/
display.xpresssyndication.com/xpress\?pubid\=
code.vogate.com/(java|)script/
[a-z]+[0-9]+.acode.ifocus.cn/
cpa.onecpc.cn/cpa/

pic.adver.com.tw/js/[a-z]+.js
y.cnxad.com/(AdCode.aspx|code.js)
s1.17luntan.com//JSResource/
www.rsdown.cn/readme/
www.7mlink.com/getcode
code.linklink8.com/code/adview_pic.php
code.is686.com/code/
init.phpwind.com/init.php?sitehash
home.wangmeng.com/ai.aspx
u2.kubao.com/lib/
code.5k3g.com/[^/]+/[0-9]+{1,4}x[0-9]+{1,4}(_[#1:300]|).htm\?uid
cpv.channelray.com/preroll/player/SuiShi/js/SuiShiInsiderOnline.js
www(1|2|).piaodown.com/down/vvv/
server.yoyi.com.cn/afp/get/\;pl_id\=
ec2.3527.com/popn_[0-9_]+.js
www.idc-1.com/js/js.php\?t\=
a[#1:300].insenz.com/adv\?sid\=
www.duote.com/duotehtml/*.(js|html)(^?)
www.52z.com/show/*.(html|jpg)(^?)
u.x-push.net/tl/[#1:1680]x[#1:1050][a-z]+/*.html(^?)
u.114.com.cn/show/us.php\?userid\=
cj.8le8le.com/cj/
y.abang.com/z/bd/
www.openv.com/spread/
file.chanet.com.cn/image.cgi\?a\=

u.heima8.com/pv/js/uusee.js
ppntv.033.com/clientPage/js.jsp

#! wwenglish
www.wwenglish.com/inc/com/*.gif
*du(i|)lian.js
*((dhtml|)popoup)(_|)(ad(v|s|)(float|content(integrated|)).js


#! -------------- software ------------
www.ieinspector.com/autoupdate/HttpAnalyzerStdV2.inf


########### Video Sites ##############
#! -------------- Tudou --------------
img01.p2v.tudou.com/(poster|mk2)/*.(jpg|gif)
#ad((extension|)control|play).tudou.com/
r.6.cn/*.swf
my.51job.com/my/trace.php\?adsnum\=
h1.untang.com/display.vip
adtology.com/


########## eBook Sites ##########
www.qidian.com/BookReader/AdOver.aspx\?img\=



#!------------- youku --------------
html.atm.youku.com/html\?p\=
so.youku.com/pv/baiduTopic

#! -------------- qihoo --------------
code.qihoo.com/ad_bcast/html_show.js\?a\=

#! ------------- feed AD --------------
feed.feedsky.com/~cpm/*/s.gif
www.pheedo.com/img.phdo\?s\=
feeds.feedburner.com/~a/*\?i\=
\w.img.pheedo.com/img.phdo\?x\=


#! ------ Virus ---------
*.llsging.com/


#! ------------ Foreign ----------------
(bdv|www).bidvertiser.com/BidVertiser.dbm
www.ttzmedia.com/affiliate/*_ad.js
adserving.cpxinteractive.com/st\?ad_type
(a.tribalfusion.com/j.ad\?site\=|ctxtad.tribalfusion.com/ctxtad/Keywords\?ads\=*link\=|ctxt.tribalfusion.com/ctxt/textlinks.js)
(mm|scripts).chitika.net/((eminimalls/mm|static/whitespacead/whitespacead).js|minimall\?*client\=)
cache.blogads.com/[0-9]+/feed.(js|css)
pages.etology.com/js2/[0-9]+.php
www.text-link-ads.com/
popunder.adsrevenue.net/popup.php\?
[^/]++.advertlets.com/
servedby.infinite-ads.com/add/script.js\?
bwp.download.com/search\?dw-siteid\=
cdn1.eyewonder.com/[0-9/]++/wrapper.js
ad.yieldmanager.com/

adtology.com/[^/]++/[#1:2000]x[0-9]+{1,4}a.htm

[^/]++.adbrite.com/mb/text_group.php\?sid\=

anonymouse.org/cgi-bin/anon-www.cgi/http://de.adserver.yahoo.com/a\?f\=

#! ----------- Pc World ----------------
jlinks.industrybrains.com/jsinsert\?sid\=

# ------------ Imageshack ----------------
www.finestresults.com/.jt\?uid\=

# -------------- sourceforge ---------------------
jobs.sourceforge.net/feeds/jobroll/\?

#! ----------- amazon -------------------
cls.assoc-amazon.com/[a-z]+{1,4}/links/updateLinks-2.js
cls.assoc-amazon.com/s/cls.js
www.assoc-amazon.com/s/ads(-common|).js


te.kontera.com/ContentLink/ContentLink\?publisherId\=
kona.kontera.com/javascript/lib/*.js

#!--------------- keywords ---------------------
*/(
	(
		(rss|center)ad(v|frame|)((list(new|)|)|)([#1:200]|)|
		list_adv|
		ad_[^/.]+|
		(guan|index|)gg(ao|ad|)([0-9]+|)).htm(l|)(^?)|
	(
		(qv_|)adv_content|
		popwindowjs|
		18ads|
		event/event/editorcommend_[0-9]+{6}).php(^?)|
	robot(s|).txt(^?)|
	(ad|all|nn|db|promo(tion|))([-_]|)banner|
	(ShowADContent|(ad|union)js|[a-z0-9]+_adshow|bbsAD).asp\?|
	float.jsp|
	AdCode.aspx|
	ad(\?|/|s|v|(_|)(image|s(e|)rv|box)|cycle|rotate|mentor|click|f[ra]m|script|stream|fetch|log|space)|
	(
			adframe.php|
			ad(s|v|f|)(_|)(show|view|js.php)(_pic.php|)|
			ggjs.php|
			count(er|).php|
			(get|iframe)AD.php|
			sysbanner.php)\?|
	[a-z]+\?ad_type\=|
	(
		topbanner([1-9]+|)|
		g(uang|)g(ao|)|
		ad(_(fly_|)|v|)[#1:100]|
		include/javascript/msn|
		(float|site)adv|
		ad(v_display|(v|vjs|js)ContentIntegrated)|
		js/(showgm|base64|ad(v(|er)|s)_(left|right)|ad(word(s|)|))|
		ad(s|)_*|
		([^/]++_|show|link)ads|
		WaitAd(v|s|js|)|
		ad(s|)flash|
		114/114|
		popup_win|
		Flashget_base64|
		flash_noline|
		script/pop|
		webThunderDetect|
		float(tb|)).js(^?))


# Directory
*/(
	(guan|index|js|)gg(ao|ad|)(js|img|image(s|)|)|
	(4|gg_|)ad((_|)(file|pic(ture|)|view)(s|)|s|v(er(tis(e|ing)|t|)|)|(v|_|)js|pop|portal|polestar|show|_src|)(_((publish|logo)|html|)|)([#1:100]|)|
	pop(up|me|ads(new|))|
	ads(union|888|)|
	union|
	Ad(Portal|process)|
	phpAdsNew|
	(_|)sponsor(s|ed(review(s|)|)|))/

*/(
	banner([0-9]+|)|
	ad-[a-z1-9-]++|
	mini_ad([#1:30]|)|
	ad(v|s|_)([#1:200]|)|
	guanggao|
	webad/images/[^/]++|
	[a-z0-9\-_]++banner).(jpg|gif|swf)
# Union
union([#2]|).(
	mop.com/ushow.jsp|
	qyule.com|
	filmyeah.com/banner/banner.html|
	cmfu.com|
	163.com/|
	50bang.org/js/|
	.narrowad.com/proxy/js/(paras|(n|bhv)ad).js(^?)|
	114.com.cn/search/|
	51traffic.com/union/union.js\?traffic_usr_id\=|
	narrowad.com/|
	baidu.com/|
	co-cm.com/|
	pomoho.com)
adunion.pconline.com.cn/show\?id\=
unionch.qyule.com/cgi-bin/code.cgi

# Domain
#ads.(
	telwords.com/show.php|
	telcenter.cn/show.php|
	.mininova.org/adjs.php|
	manyway.net)




# Virus
jump.adminbbs.info/(index|up|down).js

############## Bypass #######################
~[^/]+/*(ad(sl|v(i[cs]|o|an|ertencia|ent|*search|erse)|sense-) |(lo|thre|he|d|gr|l|ro|re|squ|class(ified|))ads|account)
sixsheeps
Posts: 30
Joined: Jan 17 2008, 14:59

Re: 请教ggao 的写法

Post by sixsheeps »

看了一遍,发现写的很全。用这个过滤一般人都够用了!呵呵。厉害啊。之前一直在研究web page的单个网站的过滤,现在再加上header的通用过滤,就够用了。
有一个问题:这个header过滤是不是每次浏览网页都把那么多内容全部匹配一遍呢?
Ray4
Moderator
Posts: 65
Joined: Dec 30 2007, 09:37

Re: 请教ggao 的写法

Post by Ray4 »

sixsheeps wrote:看了一遍,发现写的很全。用这个过滤一般人都够用了!呵呵。厉害啊。之前一直在研究web page的单个网站的过滤,现在再加上header的通用过滤,就够用了。
有一个问题:这个header过滤是不是每次浏览网页都把那么多内容全部匹配一遍呢?
个人感觉,直接用header过滤应该比内容过滤要好一些,我用firefox浏览的时候,基本不会留下什么空白,但用ie、TW、GB时就会发现很多空白(不知道opera怎么样),所以用header加上一部分内容过滤足以,不用再为空白部分添加 新的规则了(如果用ie,嘿嘿, 8-) )。

至于是否全部匹配,我想应该是的
Post Reply