phpを使用してh1タグからすべての値を取得する

Question

テキストからすべてのh1タグ値を含む配列を受け取りたい

たとえば、これが指定された入力文字列の場合：

<h1>hello</h1> <p>random text</p> <h1>title number two!</h1>

これを含む配列を受け取る必要があります：

titles[0] = 'hello', titles[1] = 'title number two!'

文字列の最初のh1値を取得する方法はすでに理解しましたが、指定された文字列内のすべてのh1タグのすべての値が必要です。

私は現在、これを使用して最初のタグを受け取ります。

function getTextBetweenTags($string, $tagname) { $pattern = "/<$tagname ?.*>(.*)<\/$tagname>/"; preg_match($pattern, $string, $matches); return $matches[1]; }

解析したい文字列を渡し、$ tagnameとして「h1」を入力します。自分で書いたわけではありませんが、コードを編集してやりたいことを実行しようとしていますが、実際には何も機能しません。

私は誰かが私を助けてくれることを望んでいました。

前もって感謝します。

Sergey Eremin · Accepted Answer

simplehtmldom を使用できます：

function getTextBetweenTags($string, $tagname) { // Create DOM from string $html = str_get_html($string); $titles = array(); // Find all tags foreach($html->find($tagname) as $element) { $titles[] = $element->plaintext; } }

Wrikken · Answer

function getTextBetweenTags($string, $tagname){ $d = new DOMDocument(); $d->loadHTML($string); $return = array(); foreach($d->getElementsByTagName($tagname) as $item){ $return[] = $item->textContent; } return $return; }

Gordon · Answer

DOMの代替。メモリが問題になる場合に使用します。

$html = <<< HTML <html> <h1>hello<span>world</span></h1> <p>random text</p> <h1>title number two!</h1> </html> HTML; $reader = new XMLReader; $reader->xml($html); while($reader->read() !== FALSE) { if($reader->name === 'h1' && $reader->nodeType === XMLReader::ELEMENT) { echo $reader->readString(); } }

Ahmed Aman · Answer

 function getTextBetweenH1($string) { $pattern = "/<h1>(.*?)<\/h1>/"; preg_match_all($pattern, $string, $matches); return ($matches[1]); }