Python3エラー：TypeError： 'bytes'オブジェクトを暗黙的にstrに変換できません

Question

Learnpythonthehardwayの演習41に取り組んでいますが、エラーが発生し続けます：

 Traceback (most recent call last): File ".\url.py", line 72, in <module> question, answer = convert(snippet, phrase) File ".\url.py", line 50, in convert result = result.replace("###", Word, 1) TypeError: Can't convert 'bytes' object to str implicitly

本がpython2を使用している間にpython3を使用しているため、いくつかの変更を加えました。スクリプトは次のとおりです。

#!/usr/bin/python # Filename: urllib.py import random from random import shuffle from urllib.request import urlopen import sys Word_URL = "http://learncodethehardway.org/words.txt" WORDS = [] PHRASES = { "class ###(###):": "Make a class named ### that is-a ###.", "class ###(object):
	def __init__(self, ***)" : "class ### has-a __init__ that takes self and *** parameters.", "class ###(object):
	def ***(self, @@@)": "class ### has-a funciton named *** that takes self and @@@ parameters.", "*** = ###()": "Set *** to an instance of class ###.", "***.*** = '***'": "From *** get the *** attribute and set it to '***'." } # do they want to drill phrases first PHRASE_FIRST = False if len(sys.argv) == 2 and sys.argv[1] == "english": PHRASE_FIRST = True # load up the words from the website for Word in urlopen(Word_URL).readlines(): WORDS.append(Word.strip()) def convert(snippet, phrase): class_names = [w.capitalize() for w in random.sample(WORDS, snippet.count("###"))] other_names = random.sample(WORDS, snippet.count("***")) results = [] param_names = [] for i in range(0, snippet.count("@@@")): param_count = random.randint(1,3) param_names.append(', '.join(random.sample(WORDS, param_count))) for sentence in snippet, phrase: result = sentence[:] # fake class names for Word in class_names: result = result.replace("###", Word, 1) # fake other names for Word in other_names: result = result.replace("***", Word, 1) # fake parameter lists for Word in param_names: result = result.replace("@@@", Word, 1) results.append(result) return results # keep going until they hit CTRL-D try: while True: snippets = list(PHRASES.keys()) random.shuffle(snippets) for snippet in snippets: phrase = PHRASES[snippet] question, answer = convert(snippet, phrase) if PHRASE_FIRST: question, answer = answer, question print(question) input("> ") print("ANSWER: {}

".format(answer)) except EOFError: print("
Bye")

ここで何が間違っていますか？ありがとう！

Ashwini Chaudhary · Accepted Answer

urlopen()はbytesオブジェクトを返します。それに対して文字列操作を実行するには、最初にstrに変換する必要があります。

for Word in urlopen(Word_URL).readlines(): WORDS.append(Word.strip().decode('utf-8')) # utf-8 works in your case

正しい文字セットを取得するには： Pythonで正しい文字セットを使用してany（！）Webページをダウンロードする方法

abarnert · Answer

Python 3、 urlopen関数は、バイナリのように動作する HTTPResponse オブジェクトを返しますファイル。だから、これを行うとき：

for Word in urlopen(Word_URL).readlines(): WORDS.append(Word.strip())

…bytesオブジェクトの代わりにstrオブジェクトの束になります。したがって、これを行うと：

result = result.replace("###", Word, 1)

…文字列result内の文字列"###"をbytesの代わりにstrオブジェクトに置き換えようとします。したがって、エラー：

TypeError: Can't convert 'bytes' object to str implicitly

答えは、単語を入手したらすぐに明示的にデコードすることです。そのためには、HTTPヘッダーから適切なエンコードを把握する必要があります。どうやって？

この場合、ヘッダーを読むと、ASCIIであり、明らかに静的ページであることがわかります。

for Word in urlopen(Word_URL).readlines(): WORDS.append(Word.strip().decode('ascii'))

しかし実際には、通常、ヘッダーを読み取って動的に把握するコードを記述する必要があります。または、 requests のような高レベルのライブラリをインストールすることをお勧めします。これは自動的に実行しますです。

rajkrish06 · Answer

バイト型「Word」を明示的に文字列に変換します

result = result.replace("###", sre(Word), 1)

うまくいくはず