本篇文章主要介绍PHP使用Snoopy类实现页面抓取的方法,感兴趣的朋友参考下,希望对大家有所帮助。
本文实例讲述了php中Snoopy类用法,具体分析如下:
这里演示了php中如何通过Snoopy抓取网页信息
/*You need the snoopy.class.php from http://snoopy.sourceforge.net/*/include("snoopy.class.php");$snoopy = new Snoopy;// need an proxy?://$snoopy->proxy_host = "my.proxy.host";//$snoopy->proxy_port = "8080";// set browser and referer:$snoopy->agent = "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)";$snoopy->referer = "http://www.jonasjohn.de/";// set some cookies:$snoopy->cookies["SessionID"] = '238472834723489';$snoopy->cookies["favoriteColor"] = "blue";// set an raw-header:$snoopy->rawheaders["Pragma"] = "no-cache";// set some internal variables:$snoopy->maxredirs = 2;$snoopy->offsiteok = false;$snoopy->expandlinks = false;// set username and password (optional)//$snoopy->user = "joe";//$snoopy->pass = "bloe";// fetch the text of the website www.google.com:if($snoopy->fetchtext("http://www.google.com")){ // other methods: fetch, fetchform, fetchlinks, submittext and submitlinks // response code: print "response code: ".$snoopy->response_code."<br/>\n"; // print the headers: print "<b>Headers:</b><br/>"; while(list($key,$val) = each($snoopy->headers)){ print $key.": ".$val."<br/>\n"; } print "<br/>\n"; // print the texts of the website: print "<pre>".htmlspecialchars($snoopy->results)."</pre>\n";}else { print "Snoopy: error while fetching document: ".$snoopy->error."\n";}
总结:以上就是本篇文的全部内容,希望能对大家的学习有所帮助。
相关推荐:
P本文来源gaodai#ma#com搞@@代~&码*网/搞gaodaima代码HP基于memcache实现环形队列的方法
php操作图片的大小修改、加水印、生成验证码、输出及保存
PHP读取配置文件类实例
以上就是PHP使用Snoopy类实现页面抓取的方法的详细内容,更多请关注搞代码gaodaima其它相关文章!