lxml xpathクエリで空の名前空間を使用するにはどうすればよいですか？

Question

次の形式のxmlドキュメントがあります。

<feed xmlns="http://www.w3.org/2005/Atom" xmlns:openSearch="http://a9.com/-/spec/opensearchrss/1.0/" xmlns:gsa="http://schemas.google.com/gsa/2007"> ... <entry> <id>https://ip.ad.dr.ess:8000/feeds/diagnostics/smb://ip.ad.dr.ess/path/to/file</id> <updated>2011-11-07T21:32:39.795Z</updated> <app:edited xmlns:app="http://purl.org/atom/app#">2011-11-07T21:32:39.795Z</app:edited> <link rel="self" type="application/atom+xml" href="https://ip.ad.dr.ess:8000/feeds/diagnostics"/> <link rel="edit" type="application/atom+xml" href="https://ip.ad.dr.ess:8000/feeds/diagnostics"/> <gsa:content name="entryID">smb://ip.ad.dr.ess/path/to/directory</gsa:content> <gsa:content name="numCrawledURLs">7</gsa:content> <gsa:content name="numExcludedURLs">0</gsa:content> <gsa:content name="type">DirectoryContentData</gsa:content> <gsa:content name="numRetrievalErrors">0</gsa:content> </entry> <entry> ... </entry> ... </feed>

Lxmlのxpathを使用してすべてのentry要素を取得する必要があります。私の問題は、空の名前空間の使用方法がわからないことです。次の例を試しましたが、うまくいきません。お知らせ下さい。

import lxml.etree as et tree=et.fromstring(xml)

私が試したさまざまなことは次のとおりです。

for node in tree.xpath('//entry'):

または

namespaces = {None:"http://www.w3.org/2005/Atom" ,"openSearch":"http://a9.com/-/spec/opensearchrss/1.0/" ,"gsa":"http://schemas.google.com/gsa/2007"} for node in tree.xpath('//entry', namespaces=ns):

または

for node in tree.xpath('//\"{http://www.w3.org/2005/Atom}entry\"'):

この時点で、私は何を試すべきかわかりません。どんな助けでも大歓迎です。

mzjn · Accepted Answer

このようなものが機能するはずです：

import lxml.etree as et ns = {"atom": "http://www.w3.org/2005/Atom"} tree = et.fromstring(xml) for node in tree.xpath('//atom:entry', namespaces=ns): print node

http://lxml.de/xpathxslt.html#namespaces-and-prefixes も参照してください。

代替：

for node in tree.xpath("//*[local-name() = 'entry']"): print node

Seb · Answer

findall メソッドを使用します。

for item in tree.findall('{http://www.w3.org/2005/Atom}entry'): print item