如何更改此資料的形狀,使主題列成為包含所有唯一值的頂行標題。Name 和 Surnames 作為唯一值列在第 1 列和第 2 列中。然后在每個單元格中,我需要判斷此人是否屬于該學科類別。我需要轉置或重塑資料,但這到底是如何用代碼完成的?
SUBJECT NAME SURNAME
Art person1 Surname1
Art person2 surname2
Art person3 Surname3
Art person4 Surname4
Art person5 Surname5
Art person6 Surname6
Art person7 Surname7
Art person8 Surname8
DT person1 Surname1
DT person3 Surname3
DT person5 Surname5
Photography person1 Surname1
Photography person2 surname2
Photography person3 Surname3
Photography person5 Surname5
Photography person8 Surname8
Games person4 Surname4
Games person5 Surname5
Games person6 Surname6
Games person7 Surname7
Games person8 Surname8
Games person9 Surname9
所以它看起來像這樣:
Name Surname Art DT Photography Games
person1 Surname1 True False True etc....
person2 surname2 False True False etc...
person3 Surname3
person4 Surname4
person5 Surname5
person6 Surname6
person7 Surname7
person8 Surname8
person9 Surname9
uj5u.com熱心網友回復:
利用:
(df.value_counts().unstack(0)
.notnull()
.reindex(columns=df['SUBJECT'].unique())
.reset_index()
.rename_axis(columns=None))
輸出:
NAME SURNAME Art DT Photography Games
0 person1 Surname1 True True True False
1 person2 surname2 True False True False
2 person3 Surname3 True True True False
3 person4 Surname4 True False False True
4 person5 Surname5 True True True True
5 person6 Surname6 True False False True
6 person7 Surname7 True False False True
7 person8 Surname8 True False True True
8 person9 Surname9 False False False True
uj5u.com熱心網友回復:
這是一個crosstab
轉換為布林值:
out = (pd
.crosstab([df['NAME'], df['SURNAME']], df['SUBJECT'])
.astype(bool)
.reset_index().rename_axis(columns=None)
)
輸出:
NAME SURNAME Art DT Games Photography
0 person1 Surname1 True True False True
1 person2 surname2 True False False True
2 person3 Surname3 True True False True
3 person4 Surname4 True False True False
4 person5 Surname5 True True True True
5 person6 Surname6 True False True False
6 person7 Surname7 True False True False
7 person8 Surname8 True False True True
8 person9 Surname9 False False True False
轉載請註明出處,本文鏈接:https://www.uj5u.com/qukuanlian/536352.html
標籤:Python擅长熊猫
下一篇:添加新列時如何自動填充公式