Pandas辞書のリストへのDataFrame

Question

次のDataFrameがあります。

 顧客item1 item2 item3 1 Appleミルクトマト 2水オレンジポテト 3ジュースマンゴーチップ

行ごとの辞書のリストに変換したい

rows = [{'customer': 1, 'item1': 'Apple', 'item2': 'milk', 'item3': 'tomato'}, {'customer': 2, 'item1': 'water', 'item2': 'orange', 'item3': 'potato'}, {'customer': 3, 'item1': 'juice', 'item2': 'mango', 'item3': 'chips'}]

ComputerFellow · Accepted Answer

編集

John Galtが his answer で言及しているように、おそらく代わりにdf.to_dict('records')を使用する必要があります。手動で移調するよりも高速です。

In [20]: timeit df.T.to_dict().values() 1000 loops, best of 3: 395 µs per loop In [21]: timeit df.to_dict('records') 10000 loops, best of 3: 53 µs per loop

元の答え

以下のようにdf.T.to_dict().values()を使用します。

In [1]: df Out[1]: customer item1 item2 item3 0 1 Apple milk tomato 1 2 water orange potato 2 3 juice mango chips In [2]: df.T.to_dict().values() Out[2]: [{'customer': 1.0, 'item1': 'Apple', 'item2': 'milk', 'item3': 'tomato'}, {'customer': 2.0, 'item1': 'water', 'item2': 'orange', 'item3': 'potato'}, {'customer': 3.0, 'item1': 'juice', 'item2': 'mango', 'item3': 'chips'}]

Zero · Answer

df.to_dict('records') を使用すると、外部で転置することなく出力が得られます。

In [2]: df.to_dict('records') Out[2]: [{'customer': 1L, 'item1': 'Apple', 'item2': 'milk', 'item3': 'tomato'}, {'customer': 2L, 'item1': 'water', 'item2': 'orange', 'item3': 'potato'}, {'customer': 3L, 'item1': 'juice', 'item2': 'mango', 'item3': 'chips'}]

Hossain Muctadir · Answer

John Galt's answerの拡張として-

次のDataFrameの場合、

 customer item1 item2 item3 0 1 Apple milk tomato 1 2 water orange potato 2 3 juice mango chips

インデックス値を含む辞書のリストを取得したい場合、次のようなことができます。

df.to_dict('index')

親辞書のキーがインデックス値である辞書の辞書を出力します。この特定のケースでは、

{0: {'customer': 1, 'item1': 'Apple', 'item2': 'milk', 'item3': 'tomato'}, 1: {'customer': 2, 'item1': 'water', 'item2': 'orange', 'item3': 'potato'}, 2: {'customer': 3, 'item1': 'juice', 'item2': 'mango', 'item3': 'chips'}}