• 欢迎访问搞代码网站,推荐使用最新版火狐浏览器和Chrome浏览器访问本网站!
  • 如果您觉得本站非常有看点,那么赶紧使用Ctrl+D 收藏搞代码吧

PHP屏蔽蜘蛛访问代码及常用搜索引擎的HTTP_USER_AGENT_php技巧

php 搞代码 4年前 (2022-01-26) 21次浏览 已收录 0个评论

PHP屏蔽蜘蛛访问代码代码

常用搜索引擎名与 HTTP_USER_AGENT对应值

百度baiduspider

谷歌googlebot

搜狗sogou

腾讯SOSOsosospider

雅虎slurp

有道youdaobot

Bingbingbot

MSNmsnbot

Alexais_archiver

function is_crawler() {   $userAgent = strtolower($_SERVER['HTTP_USER_AGENT']);   $spiders = array(     'Googlebot', // Google 爬虫     'Baiduspider', // 百度爬虫     'Yahoo! Slurp', // 雅虎爬虫     'YodaoBot', // 有道爬虫     'msnbot' // Bing爬虫     // 更多爬虫关键字   );   foreach ($spiders as $spider) {     $spider = strtolower($spider);     if (strpos($userAgent, $spider) !== false) {       return true;     }   }   return false; }

下面的php代码附带了更多的蜘蛛标识

function isCrawler() {     echo $agent= strtolower($_SERVER['HTTP_USER_AGENT']);     if (!empty($agent)) {         $spiderSite= array(             "TencentTraveler",             "Baiduspider+",             "BaiduGame",             "Googlebot",             "msnbot",             "Sosospider+",             "Sogou web spider",             "ia_archiver",             "Yahoo! Slurp",             "YoudaoBot",             "Yahoo Slurp",             "MSNBot",             "Java (Often spam bot)",             "BaiDuSpider",             "Voila",             "Yandex bot",             "BSpider",             "twiceler",             "Sogou Spider",             "Speedy Spider",             "Google AdSense",             "Heritrix",             "Python-urllib",             "Alexa (IA Archiver)",             "Ask",             "Exabot",             "Custo",             "OutfoxBot/YodaoBot",             "yacy",             "SurveyBot",             "legs",             "lwp-trivial",             "Nutch",             "StackRambler",             "The web archive (IA Archiver)",             "Perl tool",             "MJ12bot",             "Netcraft",             "MSIECrawler",             "WGet tools",             "larbin",             "Fish search",         );         foreach($spiderSite as $val) {             $str = strtolower($val);             if (strpos($agent, $str) !== false) {                 return true;             }         }     } else {         return false;     } } if (isCrawler()){     echo "你好蜘蛛精!"; } else{    echo "你不是蜘蛛精啊!"; }

使用PHP实现蜘蛛访问日志统计

$useragent = addslashes(strtolower($_SERVER['HTTP_USER_AGENT'])); if (strpos($useragent, 'googlebot')!== false){$bot = 'Google';} elseif (strpos($useragent,'mediapartners-google') !== false){$bot = 'Google Adsense';} elseif (strpos($useragent,'baiduspider') !== false){$bot = 'Baidu';} elseif (strpos($useragent,'sogou spider') !== false){$bot = 'Sogou';} elseif (strpos($useragent,'sogou web') !== false){$bot = 'Sogou web';} elseif (strpos($useragent,'sosospider') !== false){$bot = 'SOSO';} elseif (strpos($useragent,'360spider') !== false){$bot = '360Spider';} elseif (strpos($useragent,'yahoo') !== false){$bot = 'Yahoo';} elseif (strpos($useragent,'msn') !== false){$bot = 'MSN';} elseif (strpos($useragent,'msnbot') !== false){$bot = 'msnbot';} elseif (strpos($useragent,'sohu') !== false){$bot = 'Sohu';} elseif (strpos($useragent,'yodaoBot') !== false){$bot = 'Yodao';} elseif <a>本2文来*源gao($daima.com搞@代@#码(网</a><strong>搞gaodaima代码</strong>(strpos($useragent,'twiceler') !== false){$bot = 'Twiceler';} elseif (strpos($useragent,'ia_archiver') !== false){$bot = 'Alexa_';} elseif (strpos($useragent,'iaarchiver') !== false){$bot = 'Alexa';} elseif (strpos($useragent,'slurp') !== false){$bot = '雅虎';} elseif (strpos($useragent,'bot') !== false){$bot = '其它蜘蛛';} if(isset($bot)){   $fp = @fopen('bot.txt','a');   fwrite($fp,date('Y-m-d H:i:s')."\t".$_SERVER["REMOTE_ADDR"]."\t".$bot."\t".'http://'.$_SERVER['SERVER_NAME'].$_SERVER["REQUEST_URI"]."\r\n");   fclose($fp); }

搞代码网(gaodaima.com)提供的所有资源部分来自互联网,如果有侵犯您的版权或其他权益,请说明详细缘由并提供版权或权益证明然后发送到邮箱[email protected],我们会在看到邮件的第一时间内为您处理,或直接联系QQ:872152909。本网站采用BY-NC-SA协议进行授权
转载请注明原文链接:PHP屏蔽蜘蛛访问代码及常用搜索引擎的HTTP_USER_AGENT_php技巧

喜欢 (0)
[搞代码]
分享 (0)
发表我的评论
取消评论

表情 贴图 加粗 删除线 居中 斜体 签到

Hi,您需要填写昵称和邮箱!

  • 昵称 (必填)
  • 邮箱 (必填)
  • 网址