Statistics/Concepts(+codes)

๐œ’2 distribution + One-Sample ๐œ’2 test

metamong 2022. 5. 2.

๐Ÿ˜ฝ ์ €๋ฒˆ ์‹œ๊ฐ„์— parametric vs. non-parametric ์ฐจ์ด์— ๋Œ€ํ•ด์„œ ๋ฐฐ์› ๋‹ค.

๐Ÿ˜ฝ ์ด๋ฒˆ ํฌ์ŠคํŒ…์—๋Š” non-parametric test๋ฅผ ์ฒ˜์Œ์œผ๋กœ ๋ฐฐ์›Œ๋ณด๋ ค ํ•จ! ์นด์ด์ œ๊ณฑ ๊ฒ€์ • ์ฒซ๋ฒˆ์งธ test - ์ ํ•ฉ๋„ ๊ฒ€์ •์— ๋Œ€ํ•ด์„œ ์•Œ์•„๋ณดZA! ๐Ÿคฉ

(๋…๋ฆฝ์„ฑ ๊ฒ€์ •์€ ๋‹ค์Œ ์‹œ๊ฐ„์—)

 

๐Ÿคฒ ๋ฒ”์ฃผํ˜• ๋ณ€์ˆ˜๋ฅผ statistical test์— ํ™œ์šฉํ•˜๋Š” ๊ฒฝ์šฐ ์นด์ด์ œ๊ณฑ๊ฒ€์ •์ด ๋งŽ์ด ์‚ฌ์šฉ๋œ๋‹ค!

๐Ÿคฒ ์นด์ด์ œ๊ณฑ์€ ํฌ๊ฒŒ ๋‘ ๊ฐ€์ง€๋กœ ๋‚˜๋ˆˆ๋‹ค.

 

โ‘  ์ ํ•ฉ๋„ ๊ฒ€์ • - Goodness of Fit Test = 'ํ•˜๋‚˜์˜ ๋ฒ”์ฃผํ˜• ๋ณ€์ˆ˜์— ๋Œ€ํ•ด ๊ฐ ๋ฒ”์ฃผ๋ณ„ ํ™•๋ฅ ์— ๊ด€ํ•œ ๊ฒ€์ •'

 

<One-Sample>

 

→ compare multiple observed proportions to expected probabilities

→ ์˜ˆ) ์–ด๋Š ์ง€์—ญ์˜ ์ฃผ๊ฑฐ ํ˜•ํƒœ๊ฐ€ ์•„ํŒŒํŠธ, ์˜คํ”ผ์Šคํ…”, ์ฃผํƒ ์ด๋ ‡๊ฒŒ ์„ธ ์ข…๋ฅ˜๊ฐ€ ์žˆ๋‹ค๊ณ  ํ•œ๋‹ค๋ฉด ๊ฐ๊ฐ ์–ด๋Š ์ •๋„์˜ ๋น„์ค‘์„ ์ฐจ์ง€ํ•˜๋Š” ๊ฐ€? ๊ฐ€์„ค๋กœ ๊ฐ๊ฐ์˜ ๋น„์ค‘์„ ์„ธ์šฐ๊ณ  ์ ํ•ฉํ•œ ์ง€, ๋ฒ”์ฃผ๋ณ„ ์ƒ๋Œ€์  ๋นˆ๋„๋ฅผ ์ด์šฉํ•ด ์„ธ์šด ๊ฒƒ์ด ์ ์ ˆํ•œ ์ง€?

 

โ‘ก ๋…๋ฆฝ์„ฑ ๊ฒ€์ • - Test of Independence = '์„œ๋กœ ๋‹ค๋ฅธ ๋‘ ๋ฒ”์ฃผํ˜• ๋ณ€์ˆ˜ ๊ฐ„์— ์—ฐ๊ด€์„ฑ์ด ์žˆ๋Š” ์ง€' (์ ํ•ฉ๋„๋ณด๋‹ค ๋” ๋งŽ์ด ์“ฐ์ž„!)

(๋‹ค์Œ ํฌ์ŠคํŒ…)

 

<Two-Samples (Independent)>

 

→ evaluate the association btw two categorical variables

→ ์˜ˆ) ๋ฒ”์ฃผํ˜• ๋ณ€์ˆ˜๊ฐ€ 2๊ฐœ ์ฃผ์–ด์ง€๊ณ  ์„œ๋กœ ๊ฐ„์˜ ์—ฐ๊ด€์„ฑ์„ ๋”ฐ์ง„๋‹ค - ์˜ˆ๋กœ ์ˆ˜ํ•™๊ณผ๋ชฉ์˜ ํ•™์ ๊ณผ ํ†ต๊ณ„๊ณผ๋ชฉ์˜ ํ•™์ ์ด ์„œ๋กœ ์—ฐ๊ด€์„ฑ์ด ์žˆ๋Š” ์ง€ ๊ฒ€์ •

* ๐œ’2 distribution

๐Ÿ‘ถ ๋จผ์ € ์นด์ด์ œ๊ณฑ๋ถ„ํฌ๋ฅผ ์•Œ์•„๋ณด์ž

โ‘  ํ‘œ์ค€์ •๊ทœ๋ถ„ํฌ(Z1, Z2, Z3, ... Zk)๋ฅผ ๋”ฐ๋ฅด๋Š” ์—ฌ๋Ÿฌ ๊ฐœ์˜ k๊ฐœ์˜ Z๊ฐ€ ์žˆ์„ ๋•Œ, ๊ฐ ๋ถ„ํฌ๋Š” ์„œ๋กœ ๋…๋ฆฝ์ด๊ณ  ๋žœ๋คํ•˜๊ฒŒ ๊ฐ ๋ถ„ํฌ๋ณ„๋กœ ํ•˜๋‚˜์”ฉ ์ž„์˜์˜ Z๋ฅผ ๋ฝ‘๊ณ  ๊ฐ๊ฐ ์ œ๊ณฑํ•ด์„œ ์„œ๋กœ ๋”ํ•œ๋‹ค - ์ œ๊ณฑํ•ฉ

โ‘ก ๋˜ ๊ฐ ํ‘œ์ค€์ •๊ทœ๋ถ„ํฌ์—์„œ ๋žœ๋คํ•˜๊ฒŒ ๋‹ค๋ฅด๊ฒŒ ๋ฝ‘์•„ ์ œ๊ณฑํ•ฉ์„ ๋˜ ๊ตฌํ•œ๋‹ค

โ‘ข ์œ„ โ‘ก๋ฅผ ๋ฌด์ˆ˜ํžˆ ๋ฐ˜๋ณตํ•ด, ์ œ๊ณฑํ•ฉ๋“ค์˜ ๋ถ„ํฌ๋ฅผ '์นด์ด์ œ๊ณฑ๋ถ„ํฌ'๋ผ๊ณ  ๋ถ€๋ฅธ๋‹ค

 

๐Ÿ‘ถ ์นด์ด์ œ๊ณฑ๋ถ„ํฌ์—์„œ ํ•„์š”ํ•œ ๋ชจ์ˆ˜๋Š” k (์ž์œ ๋„; ํ‘œ์ค€์ •๊ทœ๋ถ„ํฌ ๋”ฐ๋ฅด๋Š” ๋ณ€์ˆ˜๊ฐ€ ๋ช‡ ๊ฐœ๊ฐ€ ์ œ๊ณฑํ•ด์„œ ๋”ํ•ด์กŒ๋Š” ์ง€์˜ ์ˆ˜์น˜)

โ‰ซ ์ฆ‰ k๋งŒ ์•Œ๋ฉด x์— ๋Œ€ํ•œ ๋ถ„ํฌํ•จ์ˆ˜๋กœ ์ •์˜ํ•  ์ˆ˜ ์žˆ๋‹ค.

 

๐Ÿ‘ถ ๋Œ€์ฒด์ ์œผ๋กœ ์นด์ด์ œ๊ณฑ ๋ถ„ํฌ๋Š” ์˜ค๋ฅธ๊ผฌ๋ฆฌ๊ฐ€ ๊ธด ํ˜•ํƒœ๋ฅผ ๊ฐ€์ง

 

- ์นด์ด์ œ๊ณฑ๋ถ„ํฌ ์‹ & ๊ฐœํ˜•(์ž์œ ๋„ 3) -

 

 

๐Ÿ‘ถ ํ‰๊ท ์€ ์ž์œ ๋„ k, ๋ถ„์‚ฐ์€ 2k (์ž์œ ๋„๋ฅผ ๋‘ ๋ฐฐ ๊ณฑํ•จ)

 

๐Ÿ‘ถ ์ž์œ ๋„๊ฐ€ ์ฆ๊ฐ€ํ•˜๋ฉด?

์ž์œ ๋„๊ฐ€ ์ฆ๊ฐ€ํ•œ๋‹ค๋Š” ๊ฑด, ์ œ๊ณฑํ•ฉ์„ ๊ตฌํ•˜๋Š” ๋ฐ ์žˆ์–ด์„œ ํ‘œ์ค€์ •๊ทœ๋ถ„ํฌ์˜ ์ˆ˜๋ฅผ ๋Š˜๋ฆฐ๋‹ค๋Š” ๊ฒƒ๊ณผ ๊ฐ™์€ ๋ง์ด๋‹ค. ์ด ๋•Œ ํ‰๊ท  & ๋ถ„์‚ฐ ์‹์— ์˜ํ•ด ์˜ค๋ฅธ์ชฝ์œผ๋กœ ์ด๋™ํ•˜๋ฉด์„œ ๋ณ€๋™์„ฑ์ด ๋” ๋„“์–ด์ง€๋Š” ๊ทธ๋ž˜ํ”„ ๊ฐœํ˜•์œผ๋กœ ๋ฐ”๋€Œ๊ฒŒ ๋œ๋‹ค

 

๐Ÿ‘ถ ์ฆ‰! ๋‹ค์‹œ ์ •๋ฆฌํ•˜๋ฉด, ์นด์ด์ œ๊ณฑ๋ถ„ํฌํ™•๋ฅ ๋ฐ€๋„ํ•จ์ˆ˜ ๊ฐœํ˜•์€ ์˜ค๋ฅธ๊ผฌ๋ฆฌ๊ฐ€ ๊ธธ๊ฒŒ ๋Š˜์–ด์ง„ ๋น„๋Œ€์นญ ํ˜•ํƒœ์ด๋ฉฐ, ๋ถ„ํฌ ์ˆ˜๊ฐ€ ๋Š˜์–ด๋‚ ์ˆ˜๋ก ์ ์  ๋Œ€์นญ์— ๊ฐ€๊นŒ์šด ํ˜•ํƒœ๋ฅผ ๋ณด์ธ๋‹ค

 

๐Ÿ‘ถ ๋ถ„์œ„์ˆ˜) α๋ผ๊ณ  ํ•  ๋•Œ ์˜ค๋ฅธ๊ผฌ๋ฆฌ ๊ฐœํ˜•์ด๋ฏ€๋กœ P[X>c] = α๋ฅผ ๋งŒ์กฑํ•˜๋Š” X์˜ (1-α) ๋ถ„์œ„์ˆ˜ c๋ฅผ ๐œ’2 α,k(์นด์ด์ œ๊ณฑ ๊ฒ€์ •ํ†ต๊ณ„๋Ÿ‰)๋ผ๊ณ  ํ‘œ๊ธฐํ•œ๋‹ค

(์นด์ด์ œ๊ณฑ๋ถ„ํฌ๋Š” ๋Œ€์นญ์ด ์•„๋‹ˆ๋ฏ€๋กœ 0์„ ๊ธฐ์ค€์œผ๋กœ ๋ถ€ํ˜ธ๊ฐ€ ๋‹ค๋ฅด๊ณ  ๊ฐ™์€ ๊ฐ’์„ ๊ฐ€์ง€๋Š” ๊ฒฝ์šฐ๊ฐ€ ์—†๋‹ค. ๋”ฐ๋ผ์„œ ํ‘œ๋‚˜ ์†Œํ”„ํŠธ์›จ์–ด๋ฅผ ํ†ตํ•ด ๋”ฐ๋กœ ์ฐพ์•„์•ผ ํ•จ)

 

- ์ž์œ ๋„์— ๋”ฐ๋ฅธ ์นด์ด์ œ๊ณฑ๊ทธ๋ž˜ํ”„ ๊ฐœํ˜• & ๋ถ„์œ„์ˆ˜ -

 

1> Goodness of Fit (One Sample ๐œ’2 test)

1> ๊ฐ€์„ค๊ฒ€์ •

H0) ์ฃผ์–ด์ง„ data์˜ ๋ถ„ํฌ๊ฐ€ ์˜ˆ์ƒ๋˜๋Š” ๋ถ„ํฌ(expected frequencies)์™€ ๋™์ผํ•œ ๋ถ„ํฌ๋ฅผ ๋‚˜ํƒ€๋‚ธ๋‹ค

Ha) ์ฃผ์–ด์ง„ data์˜ ๋ถ„ํฌ๊ฐ€ ์˜ˆ์ƒ๋˜๋Š” ๋ถ„ํฌ์™€ ๋™์ผํ•˜์ง€ ์•Š๋‹ค (์ฆ‰, ๋น„๋ชจ์ˆ˜์  ๋ฐฉ๋ฒ•์ด๊ธฐ์— ์ •๊ทœ๋ถ„ํฌ ๊ฐ€์ •์— ๋Œ€ํ•œ ๋ถ„ํฌ ๊ฐ€์ •์ด ์–ธ๊ธ‰ x. ๋งค์šฐ ๋‹ค์–‘ํ•œ ๋ถ„ํฌ์˜ data์— ๋Œ€ํ•ด ๋ชจ๋‘ ๊ท ๋“ฑ ๋ถ„ํฌ์— ๊ด€ํ•œ ์ ํ•ฉ์„ฑ ํŒ์ •์„ ๋‚ด๋ฆด ์ˆ˜ ์žˆ๋Š” ๊ฒƒ์ด๋‹ค.)

(*์˜ˆ์ƒ๋˜๋Š” ๋ถ„ํฌ๋Š” ์ „์ฒด data์˜ ํ‰๊ท ์œผ๋กœ ๊ตฌ์„ฑ๋œ ๋ถ„ํฌ๋ฅผ ๋งŽ์ด ์‚ฌ์šฉ)

 

2> ์นด์ด์ œ๊ณฑ๋ถ„ํฌ - CV ๊ตฌํ•˜๊ธฐ

→ ์œ„ ๋ถ„์œ„์ˆ˜ ์‚ฌ์ง„์—์„œ α๋กœ ์น ํ•œ ๋ถ€๋ถ„์„ critical region(๋„“์ด๊ฐ€ α)์ด๋ผ ๋ถ€๋ฅด๋ฉฐ rejectํ•˜๋Š” ๊ณณ์ด๋‹ค

→ df(n-1 ์ž์œ ๋„)์™€ ์ฃผ์–ด์ง„ α(area to the Right of the CV)๋ฅผ ์ด์šฉํ•ด CV ๊ตฌํ•˜๊ธฐ (์นด์ด์ œ๊ณฑ ๊ฒ€์ •ํ†ต๊ณ„๋Ÿ‰)

 

* ์˜ˆ์‹œ >

- α(0.1)์™€ ์ž์œ ๋„๋ฅผ ์ด์šฉํ•ด ์นด์ด์ œ๊ณฑ ๊ฒ€์ •ํ†ต๊ณ„๋Ÿ‰ ๊ฐ’์ด 9.236์ธ ๊ฑธ ์•Œ์•„๋ƒ„ -

- ๊ทธ๋ฆฌ๊ณ  ์ด ๊ฐ’๋ณด๋‹ค ์ ์€ ๊ฐ’์ด ๋‚˜์˜ค๋ฉด reject region์ด ์•„๋‹ˆ๋ฏ€๋กœ H0์„ ์ฑ„ํƒ! -

(์•„๋ž˜ ๊ทธ๋ฆผ์—์„œ๋Š” ์นด์ด์ œ๊ณฑ ๊ฐ’์ด 5.6์ด ๋‚˜์™”์œผ๋ฏ€๋กœ H0 ์ฑ„ํƒ)

 

 

3> ์นด์ด์ œ๊ณฑ statistics ๊ตฌํ•˜๊ธฐ

→ ์ด์   ์ฃผ์–ด์ง„ CV๋ณด๋‹ค ์นด์ด์ œ๊ณฑ statistics๊ฐ€ ํฐ ์ง€ ์ž‘์€ ์ง€ ํ™•์ธํ•ด์•ผ ํ•œ๋‹ค

(โ€ป ์—ฌ๊ธฐ์„œ ์นด์ด์ œ๊ณฑ statistics๋Š” ์ˆ˜์น˜์— ๋”ฐ๋ผ ๋ณ€ํ•˜๋Š” ๊ฐ’์ด๋‹ค. ์ฆ‰, ์ˆ˜์น˜๊ฐ€ ์ „์ฒด์ ์œผ๋กœ 10๋ฐฐ๋งŒ ์ปค์ ธ๋„ ๊ฒ€์ •ํ†ต๊ณ„๋Ÿ‰ ์ˆ˜์น˜ ์ž์ฒด๊ฐ€ ์ปค์ง„๋‹ค. ๋”ฐ๋ผ์„œ ์šฐ๋ฆฌ๋Š” ๋Œ€์†Œ ๋น„๊ต(p-value)๋ฅผ ํ†ตํ•ด ๊ฐ€์„ค ๊ฒ€์ • ์—ฌ๋ถ€๋ฅผ ๋”ฐ์งˆ ์ˆ˜ ์žˆ๋‹ค)

 

- ์•„๋ž˜ ์‹์œผ๋กœ ์นด์ด์ œ๊ณฑ ํ†ต๊ณ„๋Ÿ‰์„ ๊ตฌํ•œ๋‹ค -

 

 

4> ๊ฒ€์ • ๊ฒฐ๊ณผ ๋‚ด๋ฆฌ๊ธฐ

โ‘  ์นด์ด์ œ๊ณฑ statistics >= CV์ด๋ฉด? - H0 rejected - Ha ์ฑ„ํƒ - ์ฃผ์–ด์ง„ data์˜ ๋ถ„ํฌ๊ฐ€ ์˜ˆ์ƒ๋˜๋Š” data์˜ ๋ถ„ํฌ์™€ ์ผ์น˜ํ•˜์ง€ ์•Š๋‹ค

โ‘ก ์นด์ด์ œ๊ณฑ statistics < CV์ด๋ฉด? - Ho ์ฑ„ํƒ - ์ฃผ์–ด์ง„ data์˜ ๋ถ„ํฌ๊ฐ€ ์˜ˆ์ƒ๋˜๋Š” data์˜ ๋ถ„ํฌ์™€ ์ผ์น˜ํ•œ๋‹ค

w/code

โ˜… scipy.stats.chisquare docu() โ˜…

https://docs.scipy.org/doc/scipy/reference/generated/scipy.stats.chisquare.html

 

scipy.stats.chisquare(f_obs, f_exp=None, ddof=0, axis=0)

 

'Calculate a one-way chi-square test. The chi-square test tests the null hypothesis that the categorical data has the given frequencies.'

 

f_obs๋Š” ์ธก์ •ํ•˜๊ณ  ์‹ถ์€ frequency๋“ค์˜ list๋ฅผ ์ง‘์–ด๋„ฃ์œผ๋ฉด ๋œ๋‹ค

f_exp๋Š” ์ธก์ •์— ์‚ฌ์šฉํ•  ๊ธฐ๋Œ“๊ฐ’ frequency๋“ค์˜ list๋ฅผ ์ง‘์–ด๋„ฃ๋Š”๋‹ค (default๋Š” ๋ชจ๋“  ๊ด€์ธก์น˜๊ฐ€ ๋˜‘๊ฐ™์€ ๊ฐ’์œผ๋กœ ๊ณ ๋ฅด๊ฒŒ ๋ถ„๋ฐฐ๋˜๋Š” ํ‰๊ท ๊ฐ’ list)

ddof๋Š” delta degrees of freedom์œผ๋กœ default๋Š” 0, ์ฆ‰ ์ž์œ ๋„ ๊ณ„์‚ฐ ์‹œ (k-1)๋กœ ๊ณ„์‚ฐ๋จ. ์—ฌ๊ธฐ์„œ p-value๋ฅผ ์œ„ํ•ด ์ถ”๊ฐ€๋กœ ddof๋กœ ์กฐ์ •ํ•ด์ค„ ์ˆ˜ ์žˆ๋Š” ๋ฐ dof = k - 1 -ddof๋กœ ๊ณ„์‚ฐ๋œ๋‹ค

 

โ–’ returns (1) ์นด์ด์ œ๊ณฑ ํ†ต๊ณ„๋Ÿ‰ + (2) p-value โ–’

 

โ–’ ์ฃผ์˜! โ–’

'This test is invalid when the observed or expected frequencies in each category are too small. A typical rule is that all of the observed and expected frequencies should be at least 5. According to (Pearson, Karl. “On the criterion that a given system of deviations from the probable in the case of a correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling”, Philosophical Magazine. Series 5. 50 (1900), pp. 157-175.), the total number of samples is recommended to be greater than 13, otherwise exact tests (such as Barnard’s Exact test) should be used because they do not overreject.'

 

→ ์ ์–ด๋„ 13๊ฐœ ๋ณด๋‹ค๋Š” ๋งŽ๊ฒŒ sample๋“ค์ด ์žˆ์–ด์•ผ ํ†ต๊ณ„์ ์œผ๋กœ ์˜๋ฏธ ์žˆ๋Š” ๊ฒฐ๊ณผ๊ฐ€ ๋‚˜์˜จ๋‹ค๊ณ  ํ•จ!

 

์˜ˆ์‹œ>

 

Q. ์ด 15๊ฐœ์˜ ์ˆซ์ž๋กœ ๋‚˜์—ด๋˜์–ด ์žˆ๋Š” ๋‘ ๊ฐœ์˜ list๊ฐ€ ์žˆ๋‹ค๊ณ  ํ•˜์ž. ๊ฐ list๋ณ„ ํ•ด๋‹น 15๊ฐœ์˜ ์ˆซ์ž๋กœ ๋งŒ๋“ค์–ด์ง„ ๋ถ„ํฌ๋Š” ํ•ด๋‹น ์ˆซ์ž๋“ค์˜ ํ‰๊ท ์œผ๋กœ ๋งŒ๋“ค์–ด์ง„ ๊ธฐ๋Œ“๊ฐ’ ๋ถ„ํฌ์™€ ์ผ์น˜ํ•˜๋‹ค๊ณ  ๋งํ•  ์ˆ˜ ์žˆ๋Š” ์ง€(์ฆ‰ ๊ท ๋“ฑํ•œ ๋ถ„ํฌ๋ฅผ ๋„๊ณ  ์žˆ๋Š” ์ง€) ์นด์ด์ œ๊ณฑ๊ฒ€์ •์„ ์ด์šฉํ•ด ๊ฐ๊ฐ ์ ํ•ฉ์„ฑ์„ ๊ฒ€์ •ํ•ด ๋ณด์ž

 

A1) 

 

import numpy as np
from scipy.stats import chisquare 

s_obs = np.array([20, 13, 25, 22, 20, 16, 19, 21, 23, 17, 19, 22, 20, 20, 19, 26, 24, 19, 17, 21])
chisquare(s_obs, axis=None)

#Power_divergenceResult(statistic=9.05955334987593, pvalue=0.9724771417573945)

 

* ๊ฒฐ๊ณผ> ์ฒซ๋ฒˆ์งธ list๋Š” pvalue๊ฐ€ 0.05๋ณด๋‹ค ํฌ๊ฒŒ ์ธก์ •๋˜๋ฏ€๋กœ H0์€ ๊ธฐ๊ฐ๋˜์–ด์„œ๋Š” ์•ˆ๋œ๋‹ค. ๋”ฐ๋ผ์„œ s_obs list์˜ ์ˆซ์ž๋กœ ๋งŒ๋“ค์–ด์ง„ ๋ถ„ํฌ๋Š” ํ‰๊ท ์„ ๋”ฐ๋ฅด๋Š” ๊ธฐ๋Œ“๊ฐ’ ๋ถ„ํฌ์™€ ์ผ์น˜ํ•œ๋‹ค(๊ท ๋“ฑํ•œ ๋ถ„ํฌ๋ฅผ ๋”ฐ๋ฅธ๋‹ค)๊ณ  ํ†ต๊ณ„์  ์œ ์˜์„ฑ ๋ฒ”์œ„ ๋‚ด์—์„œ ์„ค๋ช…ํ•  ์ˆ˜ ์žˆ๋‹ค.

 

A2)

 

s_obs = np.array([3, 2, 1, 5, 9, 2, 5, 5, 8, 2, 85, 76, 82, 82, 80, 79, 83, 81, 80, 78])
chisquare(s_obs)
#Power_divergenceResult(statistic=691.2924528301887, pvalue=2.513563617019893e-134)

 

* ๊ฒฐ๊ณผ> ๋‘๋ฒˆ์งธ lists๋Š” 0.05๋ณด๋‹ค ๋งค์šฐ ์ž‘๊ฒŒ ์ธก์ •๋˜๋ฏ€๋กœ H0์€ ๊ธฐ๊ฐ, ์ฆ‰ ๋‘ ๋ฒˆ์งธ list๋Š” ํ‰๊ท ๊ฐ’์œผ๋กœ ๋งŒ๋“ค์–ด์ง„ ๋ถ„ํฌ์™€ ๋งค์šฐ ๋‹ค๋ฅด๋‹ค๊ณ  ๋งํ•  ์ˆ˜ ์žˆ๋‹ค. ์‹ค์ œ ์œก์•ˆ์œผ๋กœ๋งŒ ๋ณด๋”๋ผ๋„ ๋งค์šฐ ๊ฐ’์ด ์ ์€ ์ˆซ์ž๋“ค๊ณผ ํฐ ์ˆซ์ž๋“ค์ด ๊ทน๋‹จ์ ์œผ๋กœ ์–‘์ธก์— ๋ถ„ํฌํ•ด ํ‰๊ท ๊ฐ’์— ๋ชฐ๋ฆฐ ๋ถ„ํฌ์™€ ํ™•์—ฐํžˆ ๋‹ค๋ฅด๋‹ค(๊ท ๋“ฑํ•œ ๋ถ„ํฌ๋ฅผ ๋”ฐ๋ฅด๊ณ  ์žˆ์ง€ ์•Š๋‹ค)๊ณ  ํŒ๋‹จ ๊ฐ€๋Šฅ!

 

๐Ÿ‘ฉ‍๐Ÿš€ one-sample chisquare test ์™„๋ฃŒ! ๐Ÿ‘ฉ‍๐Ÿš€


* ์ธ๋„ค์ผ ์ถœ์ฒ˜) https://educationalresearchtechniques.com/2014/08/29/chi-square-goodness-of-fit-test/

* ๋‚ด์šฉ ์ผ๋ถ€ ์ถœ์ฒ˜) ProDS(์ดˆ๊ธ‰+์ค‘๊ธ‰)1

* ์ถœ์ฒ˜) https://www.youtube.com/watch?v=HKDqlYSLt68 

'Statistics > Concepts(+codes)' ์นดํ…Œ๊ณ ๋ฆฌ์˜ ๋‹ค๋ฅธ ๊ธ€

f distribution  (0) 2022.05.04
Two-Samples ๐œ’2 test  (0) 2022.05.03
Parametric vs. Non-Parametric Tests  (0) 2022.05.02
Types of Errors in Hypothesis Testing  (0) 2022.04.27
ANOVA & (One-Way ANOVA + w/code)  (1) 2022.04.25

๋Œ“๊ธ€