250x250
Link
๋‚˜์˜ GitHub Contribution ๊ทธ๋ž˜ํ”„
Loading data ...
Notice
Recent Posts
Recent Comments
๊ด€๋ฆฌ ๋ฉ”๋‰ด

Data Science LAB

[Python] Pandas๋ฅผ ์ด์šฉํ•œ ์‹œ๊ฐํ™” ๋ณธ๋ฌธ

๐Ÿ Python/์‹œ๊ฐํ™”

[Python] Pandas๋ฅผ ์ด์šฉํ•œ ์‹œ๊ฐํ™”

ใ…… ใ…œ ใ…” ใ…‡ 2022. 9. 21. 16:22
728x90

https://datascienceschool.net/01%20python/05.05%20%ED%8C%90%EB%8B%A4%EC%8A%A4%EC%9D%98%20%EC%8B%9C%EA%B0%81%ED%99%94%20%EA%B8%B0%EB%8A%A5.html

 

Pandas์˜ ์‹œ๊ฐํ™” ๊ธฐ๋Šฅ — ๋ฐ์ดํ„ฐ ์‚ฌ์ด์–ธ์Šค ์Šค์ฟจ

Pandas์˜ ์‹œ๊ฐํ™” ๊ธฐ๋Šฅ Pandas์˜ ์‹œ๋ฆฌ์ฆˆ๋‚˜ ๋ฐ์ดํ„ฐํ”„๋ ˆ์ž„์€ plot์ด๋ผ๋Š” ์‹œ๊ฐํ™” ๋ฉ”์„œ๋“œ๋ฅผ ๋‚ด์žฅํ•˜๊ณ  ์žˆ๋‹ค. plot์€ matplotlib๋ฅผ ๋‚ด๋ถ€์—์„œ ์ž„ํฌํŠธํ•˜์—ฌ ์‚ฌ์šฉํ•œ๋‹ค. np.random.seed(0) df1 = pd.DataFrame(np.random.randn(100, 3),

datascienceschool.net

import pandas as pd

np.random.seed(0)
df1 = pd.DataFrame(np.random.randn(100,3),
                   index=pd.date_range('1/1/2018', periods=100),
                   columns=['A',"B","C"]).cumsum()
df1.tail()

 

 

f1.plot()
plt.title('Pandas์˜ Plot ๋ฉ”์†Œ๋“œ ์‚ฌ์šฉ ์˜ˆ')
plt.xlabel('์‹œ๊ฐ„')
plt.ylabel('Data')
plt.show()

 

 

 

iris = sns.load_dataset('iris')
titanic = sns.load_dataset('titanic')

iris.sepal_length[:20].plot(kind='bar', rot=0)
plt.title('๊ฝƒ๋ฐ›์นจ์˜ ๊ธธ์ด ์‹œ๊ฐํ™”')
plt.xlabel('data')
plt.ylabel('๊ฝƒ๋ฐ›์นจ์˜ ๊ธธ์ด')
plt.show()

 

 

 

iris[:5].plot.bar(rot=0)
plt.title('Iris ๋ฐ์ดํ„ฐ์˜ Bar Plot')
plt.xlabel('Data')
plt.ylabel('๊ฐ Feature ๊ฐ’')
plt.ylim(0,7)
plt.show()

 

 

 

# ๊ฐ ๋ถ“๊ฝƒ์ข…์˜ ํŠน์ง•๊ฐ’์˜ ํ‰๊ท 
df2 = iris.groupby(iris.species).mean()
df2.columns.name='feature'
df2

 

 

 

 

df2.plot.bar(rot=0)
plt.title('๊ฐ ์ข…์˜ Feature๋ณ„ ํ‰๊ท ')
plt.xlabel('ํ‰๊ท ')
plt.ylabel('์ข…')
plt.ylim(0,8)
plt.show()

 

 

 

# ์ „์น˜์—ฐ์‚ฐ์œผ๋กœ ์‹œ๊ฐํ™”๋ฐฉ๋ฒ•์„ ๋‹ค๋ฅด๊ฒŒ ํ‘œํ˜„
df2.T.plot.bar(rot=0)
plt.title('๊ฐ Feature์˜ ์ข… ๋ณ„ ํ‰๊ท ')
plt.xlabel('Feature')
plt.ylabel('ํ‰๊ท ')
plt.show()

 

 

 

df3 = titanic.pclass.value_counts()
df3.plot.pie(autopct='%.2f%%')
plt.title('์„ ์‹ค๋ณ„ ์Šน๊ฐ ์ˆ˜ ๋น„์œจ')
plt.axis('equal')
plt.show()

 

 

 

 

iris.plot.hist()
plt.title('๊ฐ Feature ๊ฐ’๋“ค์˜ ๋นˆ๋„์ˆ˜ Histogram')
plt.xlabel('๋ฐ์ดํ„ฐ ๊ฐ’')
plt.show()

 

 

 

 

iris.plot.box()
plt.title('๊ฐ Feature ๊ฐ’๋“ค์˜ ๋นˆ๋„์ˆ˜์— ๋Œ€ํ•œ Box Plot')
plt.xlabel('Feature')
plt.ylabel('๋ฐ์ดํ„ฐ ๊ฐ’')
plt.show()

728x90
Comments