Python urllib2プログレスフック

Question

Urllib2 httpクライアントを使用してpythonでダウンロードプログレスバーを作成しようとしています。API（およびGoogle）を調べましたが、urllib2では進行状況を登録できないようです。フック。ただし、古い非推奨のurllibにはこの機能があります。

Urllib2を使用してプログレスバーまたはレポートフックを作成する方法を知っている人はいますか？または、同様の機能を取得するための他のハックはありますか？

Triptych · Accepted Answer

これは、応答をチャンク化するというAnuragのアプローチに基づいた完全に機能する例です。私のバージョンでは、チャンクサイズを設定し、任意のレポート関数をアタッチできます。

import urllib2, sys def chunk_report(bytes_so_far, chunk_size, total_size): percent = float(bytes_so_far) / total_size percent = round(percent*100, 2) sys.stdout.write("Downloaded %d of %d bytes (%0.2f%%)
" % (bytes_so_far, total_size, percent)) if bytes_so_far >= total_size: sys.stdout.write('
') def chunk_read(response, chunk_size=8192, report_hook=None): total_size = response.info().getheader('Content-Length').strip() total_size = int(total_size) bytes_so_far = 0 while 1: chunk = response.read(chunk_size) bytes_so_far += len(chunk) if not chunk: break if report_hook: report_hook(bytes_so_far, chunk_size, total_size) return bytes_so_far if __name__ == '__main__': response = urllib2.urlopen('http://www.ebay.com'); chunk_read(response, report_hook=chunk_report)

Anurag Uniyal · Answer

データをチャンクで読み取り、その間にやりたいことを何でもしてみませんか。スレッドで実行したり、UIにフックしたりするなど

import urllib2 urlfile = urllib2.urlopen("http://www.google.com") data_list = [] chunk = 4096 while 1: data = urlfile.read(chunk) if not data: print "done." break data_list.append(data) print "Read %s bytes"%len(data)

出力：

Read 4096 bytes Read 3113 bytes done.

Ignacio Vazquez-Abrams · Answer

rlgrabber 進行状況通知のサポートが組み込まれています。

user4459011 · Answer

簡略版：

temp_filename = "/tmp/" + file_url.split('/')[-1] f = open(temp_filename, 'wb') remote_file = urllib2.urlopen(file_url) try: total_size = remote_file.info().getheader('Content-Length').strip() header = True except AttributeError: header = False # a response doesn't always include the "Content-Length" header if header: total_size = int(total_size) bytes_so_far = 0 while True: buffer = remote_file.read(8192) if not buffer: sys.stdout.write('
') break bytes_so_far += len(buffer) f.write(buffer) if not header: total_size = bytes_so_far # unknown size percent = float(bytes_so_far) / total_size percent = round(percent*100, 2) sys.stdout.write("Downloaded %d of %d bytes (%0.2f%%)
" % (bytes_so_far, total_size, percent))

Geoff · Answer

実際にファイルを書き出すことができるようにするためのTriptychの応答へのマイナーな変更（python3）：

from urllib.request import urlopen def chunk_report(bytes_so_far, chunk_size, total_size): percent = float(bytes_so_far) / total_size percent = round(percent*100, 2) sys.stdout.write("Downloaded %d of %d bytes (%0.2f%%)
" % (bytes_so_far, total_size, percent)) if bytes_so_far >= total_size: sys.stdout.write('
') def chunk_read(response, chunk_size=8192, report_hook=None): total_size = response.info().get("Content-Length").strip() total_size = int(total_size) bytes_so_far = 0 data = b"" while 1: chunk = response.read(chunk_size) bytes_so_far += len(chunk) if not chunk: break if report_hook: report_hook(bytes_so_far, chunk_size, total_size) data += chunk return data

使用法：

with open(out_path, "wb") as f: response = urlopen(filepath) data_read = chunk_read(response, report_hook=chunk_report) f.write(data_read)