高級統(tǒng)計(jì)方法_第1頁
高級統(tǒng)計(jì)方法_第2頁
高級統(tǒng)計(jì)方法_第3頁
高級統(tǒng)計(jì)方法_第4頁
高級統(tǒng)計(jì)方法_第5頁
已閱讀5頁,還剩24頁未讀, 繼續(xù)免費(fèi)閱讀

下載本文檔

版權(quán)說明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請進(jìn)行舉報(bào)或認(rèn)領(lǐng)

文檔簡介

8.abc的代碼截圖

>college-read.csv(℃:/users/lerovo/oonloads/Col1ege.csv")

>flx(college)

>rownames(col1ege)-col1ege[,1]

>fIx(colltgt)

>co11?ge-college[t-l]

>fix(college)

>sumnary(college)

PrivateAppsAcceptEnrollToplOpercTop25percF.undergrad

Length:777Min.:81Min.:72Min.:35Min.:1.00Min.:9.0Min.:139

Class:character1stQu.:7761stQu.:6041stQu.:2421STQu.:15.001stQu.:41.01stQu.:992

Mode:characterMedian:1558Median:1110Median:434Median:23.00Median54.0Median:1707

Mean:3002*ean:2019Mean:780Mean:27.56Kean:55.8Mean:3700

3rdQu.:36243rdQu.:24243rdQu.:9023rdQu.:35.003rdQu.:69.03rdQu.:4005

Max.:480^4wax.:26330Max.:6392Max.:96.OOMax.:100.0Max.:31643

p.underaradoutstateRoon.BoardBooksp<r$onalPhDTerminal

Min.1.0Mln.:2340Min.:1780Mln.:96.0Mln.:250Min.:8.00Mln.:24.0

1stQu.95.01stQu.:73201stQu?:35971stQu.:470.01stQu.:8501stQu.:62.001stQu.:71.0

Median3S3.OMedian:9990Median:4200Median:500.0Median:1200Median:75.00Median:82.0

Mean055.3Mean:10441MeanMean;M9.4Mean;1541Mean;72.66Mean;79.7

3rdQu.967.03rdQu.:129253rdQu.:50503rdQu.:600.03rdQu.:17003rdQU.:85.003rdQu.:92.0

Max.21836.0Max.:21700Max.:8124Max.:2M0.0Max.:68OOMax.:103.00Max.:100.0

S.F.Ratioperc.alumniExpendGrad.Rate

Mln.2.50Mln.:0.00Pin.:3186Mln.:10.00

1stQU.11.501stQU.:13.001stQU.:67511stQu.:53.00

Median13.60Median:21.00median:8377wedUn:65.00

Mean14.09Mean:22.74:9660wean:65.46

3rdQu.:16.503rdQu.:31.003rdQu.:108303rdQu.:78.00

Max.39.80Max.:64.00**ax.:56233Max.:118.00

a.read.csv。運(yùn)行截圖

is數(shù)據(jù)編輯器□X

文件編策幫助

XPrivateAppsAccept

1AbileneChristianUniversityYes16601232

2AdelphiUniversityYes21861924

3AdrianCollegeYes14281097

;4AgnesScottCollegeYes417349

i5AlaskaPacificUniversityYes193146

6AlbertsonCollegeYes587479

7AlbertusMagnusCollegeYes353340

i8AlbionCollegeYes18991720

9AlbrightCollegeYes1038839

;10Alderson-BroaddusCollegeYes582498

11

:AlfredUniversityYes17321425

-12AlleghenyCollegeYes26521900

o13AllentownColl.ofSt.FrancisdeSalesYes1179780

?14AlmaCollegeYes12671080

o15AlvernoCollegeYes494313

16AmericanInternationalCollegeYes14201093

j17AmherstCollegeYes4302992

18AndersonUniversityYes1216908

19AndrewsUniversityYes1130704

b.用fix()函數(shù)觀察數(shù)據(jù)(R為每行每列大學(xué)分配名字)

數(shù)據(jù)編輯器—

Il□X(

t文件編市幫助

A

sPrivateAppsAccept

Y

r1AbileneChristianUniversityYes16601232

u2AdelphiUniversityYes21861924

r3AdrianCollegeYes14281097

a

4AgnesScottCollegeYes417349

5AlaskaPacificUniversityYes19314C

e6AlbertsonCollegeYes587479

7AlbertusMagnusCollegeYes353340

8AlbionCollegeYes18991720

e

09AlbrightCollegeYes1038839

10Alderson-BroaddusCollegeYes582498

11AlfredUniversityYes17321425J

12AlleghenyCollegeYes26521900.R

13AllentownColl.ofSt.FrancisdeSalesYes1179780E

14AlmaCollegeYes126710809

A

15AlvernoCollegeYes494313

A

16AmericanInternationalCollegeYes14201093

A

17AmherstCollegeYes4302992

u

18AndersonUniversityYes1216908

Ct

V

<>D

c.i的summary()函數(shù)截圖和ii.pairs()(匯總信息并對前十列或變量產(chǎn)生散點(diǎn)圖矩

FilesPlotsPackagesHelpViewerPresentationa口

I月Zoom-3Export▼。I/GPublish?

26102610261026102610

.iii..一,_」iiiI.—.iiI._.iiiLL._.ii[

S日二h日匕?胃KJ厘:

曰f7內(nèi)若?刁fRE

iiIIIIiIiIIiiIIillIT11iiI

26102610261026102610

c.iii用plot。函數(shù)產(chǎn)生Outstate對Private變量的沿邊箱線圖

FilesPlotsPackagesHelpViewerPresentation

I以Zoom-SExport▼。I/GPublish?

9

0

0)

0=

。

10

collegeSPrivate

C.iV的代碼和運(yùn)行截圖

Max.:59?suwax.:(>4.uuwax.nax.:i

18.00

>paris(college[,1:10])

Errorinparis(college[.1:10]):couldnotfindfunctionF?e$PictsPackaoesHelpViewerPresenUtion

-parls-/Zo?<nFExport?O,CDu&liCh?

>palrs(col1ege[,i:IU])

Errorinpairs.defau11(col1ege;.1:10]):章效值,救不抵適用

于?pNrs?

>pairs(college[,1:10])

Errorinpalrs.default(col1ege*,1:10]):善效象不能適用

^'pairs'

>college[.l]?as.numeric(factor(col1ege[,1]))

>pairs(college(.1:10])

Errorinplox.new():figuremarginsxoolarge

>college[.1:10]?as.numeric(factor(col1ege[,1:10]))

warningmessage:

inxifrm.data.frame(x):cannotxxfrtidataframes

>college[.l]>as.numeric(factor(college

>pairs(college[,1:5])

Errorinplot.new():figuremarginstoolarge

>pairs(col1ege[.1:10])

>plor(col1egeSPr1vate.collegeSoutstate)

>El1xe-repCNoM,nrov(college))

Errorinnrov(college):couldnotfindfunction"nrov"

>El1te-repC,NO'\nrow(college))

>EliTe[college$Topl0perc>50]?*Yes*'

?Elfa(.Lu?(EliL?)

Errorinas.factoe(Elire):co?ldnotfindfunction'as.faI

ctoe”

>Elite-as.factor(Elite)

>college-daia.frame(col1ege,Elite)

>sumnary(col1egeSEl1te)

NO

777

>plot(collegeSElite.col1egeSOutstate)

>vlew(college)

C.V代碼截圖和運(yùn)行截圖

>par(mfrow=c(2,2))

>hist(col1egeSApps)

>hist(collegeSperc.alumni,col=2)

>hist(college$s.F.Ratio,breaks=10)

>hist(col1egeSexpend,breaks=100)

Errorinhist.default(collegeSexpend,breaks=100):'x'必需

為數(shù)值

>hist(collegeSExpend,breaks=100)

Histogramofcollege$AppsHistogramofcollege$perc.alumni

§

§

A。A

「o

zu

88

n。n

b?b

3a

l

OLL.

U:o

e

。

。

0100003000050000

collegeSAppscollegeSpercalumni

Histogramofcollege$S.F.Ratio

^e

ua

an

n息

bb

aa

JJ

LL

§-

o

010203040

collegeSS.F.Ratio

c.vi

par(mfrow=c(l,2))

plor(col1egeSoutstate,collegeSGrad.Rate)

plor(col1ege$ToplOperc,college$Grad.Rate)

運(yùn)行截圖

00

00

--

0

①①00

J褪

E

P

S-20

S0

①。

699

①①

=

0_一

。。

。

0

7t

a-a-

50001C00020000020406080

college$Outstatecollege$Top10perc

非本州學(xué)生學(xué)費(fèi)越高,畢業(yè)率相對就越高。

從排名前10%的高中班畢業(yè)的新生的畢業(yè)率反而不是很高。

10.(a)a?e的代碼截圖

UWbJ

>?Boston

>pairs(Boston)

>plor(BostonScrim,BostonSage)

>plot(BostonSage,BostonScrim)

>plot(Boston$dis.BostonScrim)

>plot(Boston$rad,BostonScrim)

>plot(Boston$tax,BostonScrim)

>plot(BostonSptratio,BostonScrim)

>par(mfrow=c(1,3))

>hist(Boston$crim[BostonScrim>l],break=25)

Error:unexpectedin"hist(Boston$crim[Boston$crim>l],

break="

>hist(Boston$crim[BostonScrim>l],breaks=25)

>hist(RostonSrax,hrpak*;=?5)

>hist(Boston$ptratio,breaks-25)

>dim(subset(Boston,cham==l))

Errorineval(e,x,parent.frame()):object'cham'notfo

und

>dim(subset(Boston,chas==l))

[1]3514

>median(BostonSptratio)

[1]19.05

>Library(MASS)

>Boston

>?Boston

Boston{MASS}RDocumentation

HousingValuesinSuburbsofBoston

Description

TheBostondataframehas606rowsand14columns

Usage

Boston

Format

Thisdataframecontainsthefollowingcolumns:

crxm

percapitaaimeratebytown

zn

proportionofresidentiallandzonedforlotsover25,000sqft

Indus

proportionofnon-retailbusinessacrespertown,

chas

CharlesRiverdummyvariable(=Iiftraaboundsriver.0othervdse).

nox

nitrogenoxidesconcentration(partsper10million),

rm

averagenumberofroonsperdwelling.

age

proportionofowner-occupiedunitsbuiltpriorto1940

dis

rad

indexofaccessibilitytoradialhighways.

tax

full-valueproperty-taxrateper510,000.

ptratio

pupil-teacherratiobytov/n.

black

1000(BAr0.63,whereBkistheproportionofblacksbytown.

-3ZB.Z

lowerstatusofthepopulation(percent).

medv

medianvalueofowner-occupiedhomesinS1000s

Source

Harrison,DandRubinfelcDL.(1978)Hedonicpricesandthedemandforcleanair.JEnviron.Economics

andManagement5.81-102.

BelsleyDA.Kuh.EandWelsch.RE.(1980)RegressionDiagnosticsIdentifyingInfluentialDataand

SourceoofCollinearity.NewYork:Wiley.

可以看出這個(gè)數(shù)據(jù)有505行,14列。列代表每種數(shù)據(jù)如crim,zn,indus,chas

等,每行就表示不同的具體數(shù)據(jù)。

b.散點(diǎn)圖發(fā)現(xiàn)一共有14*14對關(guān)系,可以看到crim和age呈正比而和dis呈

反比關(guān)系nox和dis呈反比關(guān)系等。crim受其他變量影響變化明顯。

:o

L:L

^H□口□

J二

EEE第E□i

sI

F

BEB6G目k

t

Dh

DD一□

iI

HHH□Bl

LbE

H?HHILH-

EEHLi

n&

BE1HHiE

QCQLa

SiDH

2EDH

BH1

量30

n3

□制Id

an口

EHE

ElH

一E

§^k)J

T

1U

C.發(fā)現(xiàn)crim和年齡有關(guān),年齡偏高,犯罪率也會偏高。距離五個(gè)上班區(qū)域的

加權(quán)平均距離dis越低,高犯罪概率值越密集。

Boston$dis

d.截圖

郊外的犯罪率不會特別高,但是稅率還是蠻高的,超600的很多,師生比也比

較高

istograrnofBoston$crirn[Boston$crinHistogramofBoston$taxHistogramofBoston$ptratio

打-lT-

S-

0204060802004006001416182022

Boston$crim(Boston$crwn>1JBostonStaxBostonSptratio

e&f有35個(gè)郊區(qū)在查爾斯河岸附近,該數(shù)據(jù)集里城鎮(zhèn)師生比的中位數(shù)為19.05

Iaaaa?J

>dim(subset(Boston,chas==l))

[1]3514

>median(BostonSptratio)

[1]19.05

g.(1)業(yè)主自用住房的中位數(shù)最小的波士頓郊區(qū)是第399個(gè)。其他預(yù)測變量

的取值分別是38,35180,0,18.1,0,0.6930,5.4530,100.0,

1.4896,24.000,666.000,20.2000,396.9000,30.5900,5.000.在總體的分布上:

犯罪率高,住宅用地比例低,零售商業(yè)比例高,不靠近河,氮氧化物濃度較

高,住宅房間數(shù)不是很多,年齡很高,距離五個(gè)上班區(qū)域的加權(quán)平均距離較

近,交通發(fā)達(dá),稅率很高,師生比較高,黑人占比很高,

>c(subset(Boston,medv==min(Boston$medv)))

399406

crim38.351867.9208

7n0.00000.0000

indus18.100018.1000

chas0.00000.0000

nox0.69300.6930

rm5.45305.6830

age100.0000100.0000

dis1.48961.4254

rad24.000024.0000

tax666.0000666.0000

ptratio20.200020.2000

black396.9000384.9700

Istat30.590022.9800

medv5.00005.0000

>summary(Boston)

crimznIndus

Min.0.00632Min.0.00Min.0.46

1stQU.0.082051stQU.0.001stQU.5.19

wedian0.25651Median0.00Median9.69

Mean3.61352Mean11.36Mean11.14

3rdQu.3.677083rdQu.12.503rdQu.18.10

Max.88.97620Max.100.00Max.27.74

chasnoxrm

Min.:0.00000Min.:0.3850Min.:3.561

1stQu.:0.000001stQu.:0.44901stQu.:5.886

Median:0.00000Median:0.5380Median:6.208

Mean:0.06917Mean:0.5547Mean:6.285

3rdQu.:0.000003rdQu.:0.62403rdQu.:6.623

Max.:1.00000Max.:0.8710Max.:8.780

agedisrad

Min.:2.90Min.:1.130Min.:1.000

1stQu.:45.021stQu.:2.1001stQu.:4.000

Median;77.50Median;3.207Median;5.OOO

Mean:68.57Mean:3.795Mean:9.549

3rdQu.:94.083rdQu.:5.1883rdQu.:24,000

Max.:100.00Max.:12.127Max.:24,000

taxprrctioblack

Min.:187.0Min.:12.60Min.:0.32

1stQu.279.01stQu.:17.401stQu.:375.38

Median330.0Median:19.05Median:391,44

wean408.2Mean:18.46Mean:356.67

3rdQu.666.03rdQu.:20.203rdQu.:396.23

Max.711.0Max.:22.00Max.:396.90

Istatmedv

Min.1.73Min.:5.00

1stQu.6.951stQu.:17.02

wedian11.36Median:21.20

Mean12.65Mean:22.52

3rdQu.16.953rdQu.:25.OO

Max.37.97Max.:50.00

h.有64個(gè)郊區(qū)居民平均居住房間數(shù)量超過7,有13個(gè)郊區(qū)數(shù)超過8個(gè)房間。

居民平均居住房間數(shù)超過8個(gè)的郊區(qū)特征:犯罪率低,住宅用地比例高,零售

商業(yè)比例較之前別的組的數(shù)據(jù)更合適更合理,河,氮氧化物濃度處于中等水

平,住宅房間數(shù)多,距離五個(gè)上班區(qū)域的加權(quán)平均距離近,交通發(fā)達(dá),稅率

低,師生比例低,黑人占比極高,地位低的人群比例也處于較低狀態(tài)

>dim(subset(Boston,rm>7))

[1]6414

>dim(subset(Boston,rm>8))

[1]1314

>summary(subset(Boston,rm>8))

crimznInduschasnox

Min.:0.02009Min.:0.00Min.:2.680Min.:0.0000Min.:0.4161

1stQu.:0.331471stQU.:0.001stQu.:3.9701stclu.:0.00001stQU.:0.504G

Median:0.52014Median:0.00Median:6.200Median:0.0000Median:0.5070

Mean:0.71879Mean:13.62Mean:7.078Mean:0.1538Mean:0.5392

3rdQu.:0.578343rdQu.:20.003rdQu.:6.2003rd(iu.:0.00003rdQu.:0.6050

Max.:3.47428Max.:95.00Max.:19.580Max.:1.0000Max.:O.718C

rmagedisracitax

Min.:8.034Min.:8.40Min.:1.801Min.2.000Min.:224.0

1stQu.:8.2471stQu.:70.401stQu.:2.2881stQu.5.0001stQU.:264.0

Median:8.297Median:78.30Median:2.894Median7.000Median:307.0

Mean:8.349Mean:71.54Mean:3.430Mean7.462Mean:325.1

3rdQu.:8.3983rdQu.:86.503rdQu.:3.6523rdQu.8.0003rdQu.:3O7.0

Max.:8.780Max.:93.90wax.:8.907Max.24.000Max.:666.0

ptratioblackIstatned\

Min.:13.00Min.:354.6Min.:2.47Min.:;>1.9

1stQu.:14.701stQu.:384.51stQu.:3.321stQu.:4n.7

Median:17.40Median:386.9Median:4.14Median:418.3

Mean:16.36Mean:385.2Mean:4.31Mean:z42

3rdQu.:17.403rdQu.:389.73rdQu.:5.123rdQu.:!)0.0

Max.:20.20Max.:396.9Max.:7.44Max.:)0.0

8.(a)打開auto

>Auto=read.csv("C:/users/lenovo/Documents/TencentFiles/14

59157126/FileRecv/Auto.csv",header=T,na.strings=*'?")

>Auto=na.omit(Auto)

summary(Auto)

mpgcylindersdisplacement

Min.:9.00Min.:3.000Min.:68.0

1stQu.:17.001stQu.:4.0001stQu.:105.0

Median:22.75Median:4.000Median:151.0

Mean:23.45Mean:5.472wean:194.4

3rdQu.:29.003rdQu.:8.0003rdQu.:275,8

Max.:46.60Max.:8.000Max.:455.0

horsepowerweightacceleration

Min.:46.0Min.:1613Min.:8.00

1stQu.:75.01stQu.:22251stQu.:13.78

Median:93.5Median:2804Median:15.50

Mean:104.5Mean:2978Mean:15.54

3rdQu.:126.03rdQu.:36153rdQu.:17.02

Max.:23O.0Max.:5140Max.:24.

溫馨提示

  • 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請下載最新的WinRAR軟件解壓。
  • 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
  • 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁內(nèi)容里面會有圖紙預(yù)覽,若沒有圖紙預(yù)覽就沒有圖紙。
  • 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
  • 5. 人人文庫網(wǎng)僅提供信息存儲空間,僅對用戶上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理,對用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對任何下載內(nèi)容負(fù)責(zé)。
  • 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請與我們聯(lián)系,我們立即糾正。
  • 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時(shí)也不承擔(dān)用戶因使用這些下載資源對自己和他人造成任何形式的傷害或損失。

最新文檔

評論

0/150

提交評論