我怎样才能在PHP中获取a p标签的alt属性?
我正在编写一个只回应价格的脚本.
如果我做: $alttag = $oNode['p']; echo $alttag; 它将回显< p>< / p>中的所有内容. cafénevrij的roodmerk 2 pakken prijs每公斤1,99 199 从网站上,所以你可以看到它的回声199,这是价格,但首先我只需要在< p>< / p>中的199而且我要 .或者,在199之间,因此它将显示1,99或1.99. 如果我做: $alttag = $oNode['p sup']; echo $alttag; 它只会回复< sup>< / sup>中的99. $alttag = $oNode['p sup']; $maintag = $oNode['p']->attr('alt'); echo $maintag . $alttag; 嗯……这没什么 <div class="item-prijs"> <p> <cufon class="cufon cufon-canvas" alt="1" style="width: 27px; height: 42px; "> <canvas width="47" height="43" style="width: 47px; height: 43px; top: -1px; left: -2px; "></canvas> <cufontext>1</cufontext> </cufon> <sup> <cufon class="cufon cufon-canvas" alt="99" style="width: 24px; height: 20px; "> <canvas width="35" height="21" style="width: 35px; height: 21px; top: -1px; left: -1px; "> </canvas><cufontext>99</cufontext> </cufon> </sup> </p> </div> 完整代码:不包含php函数和数据库连接. // Extracts offers from html and return in array function extractSparOffers($url) { loadPqUrl($url); //Test $dates = extractDateRange(pq('.contentdatagrid td:first')); $oNodes = pq('.item'); if($oNodes->count() == 0) throw new Exception('No offers were found.'); foreach($oNodes as $oNode) { $oNode = pq($oNode); //Test $titleDescCell = $oNode['input#a']->parent(); //Test $titleDescCell['img,input']->remove(); $priceCell = $oNode['span.price1']->parent()->parent(); // Get title and description $data['title'] = $oNode['.item-content h3']; $data['description'] = $oNode['.item-content p']; // Get prices (page may contain price ranges) $alttag = $oNode['p sup']; $maintag = $oNode['p']->attr('alt'); echo $maintag; //echo $alttag; //$alttags=preg_match_all('/<img[^>]*alt="([^"]*)"/i',$html,$matches); $none = "0.00"; $data['priceBefore'] = $none; $data['priceAfter'] = $alttag; // $oNode['item-prijs p.sup.cufon cufon-canvas']->attr('alt') ; // Get image $imgNode = $oNode['img:only-child']; if(count($imgNode) > 0) $img = getimg('http://www.spar.nl/' . $oNode['img:only-child']-> attr('src')); else $img = ''; $data['image'] = $img; //Test $data['dateStart'] = $dates['start']; //Test $data['dateEnd'] = $dates['end']; $date =date('Y-m-d'); $data['dateStart'] = date('Y-m-d',strtotime("yesterday")); $data['dateEnd'] = date('Y-m-d',strtotime("tomorrow")); $data = formatOfferStrings($data); $odTotal[] = $data; } return $odTotal; } spiderInit(); $offerData = extractSparOffers('http://www.spar.nl/aanbiedingen/'); //Test processNewOffers('Spar',$offerData,$offerData[0]['dateStart']); processNewOffers('Spar',$dates['start']); ?> 解决方法
那么这基本上是价格的网络爬虫吗?我建议你研究一下使用PHP的DOMDocument库解析XML(实际上是哪个XHTML).然后你可以这样做:
//create a new DOMDocument object $xmlDoc = new DOMDocument(); //load your html for parsing $xmlDoc->loadHTML("<html><body>Your HTML Code<br></body></html>"); //select the element that you want the attribute from...you may need to use $xmlDoc->getElementsByTagName('p'); $p_element = $xmlDoc->getElementById('yourtag'); //get the attribute alt of the selected element $alt = $p_element->getAttribute('alt'); //show alt attribute value echo $alt; 这只是伪代码,并不能解决您的问题,但它似乎是一个比您尝试使用的解析器更好的解决方案.查看这些链接以获取更多信息(我希望这会有所帮助): http://www.php.net/manual/en/domdocument.construct.php http://php.net/manual/en/domelement.getattribute.php http://www.php.net/manual/en/domdocument.getelementsbytagname.php http://www.php.net/manual/en/domdocument.getelementbyid.php (编辑:李大同) 【声明】本站内容均来自网络,其相关言论仅代表作者个人观点,不代表本站立场。若无意侵犯到您的权利,请及时与联系站长删除相关内容! |