版權(quán)說明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請進行舉報或認領(lǐng)
文檔簡介
DefiningandCollectingDataChapter1ObjectivesInthischapteryoulearn:
Tounderstandissuesthatarisewhendefiningvariables.HowtodefinevariablesHowtocollectdataToidentifydifferentwaystocollectasampleUnderstandthetypesofsurveyerrorsClassifyingVariablesByTypeCategorical(qualitative)variablestakecategoriesastheirvaluessuchas“yes”,“no”,or“blue”,“brown”,“green”.Numerical(quantitative)variableshavevaluesthatrepresentacountedormeasuredquantity.DiscretevariablesarisefromacountingprocessContinuousvariablesarisefromameasuringprocessDCOVAExamplesofTypesofVariablesDCOVAQuestionResponsesVariableTypeDoyouhaveaFacebookprofile?YesorNoCategorical(Qualitative)Howmanytextmessageshaveyousentinthepastthreedays?---------------Numerical(discrete)Howlongdidthemobileappupdatetaketodownload?---------------Numerical(continuous)TypesofVariablesVariablesCategoricalNumerical
DiscreteContinuousExamples:MaritalStatusPoliticalPartyEyeColor
(Definedcategories)Examples:NumberofChildrenDefectsperhour
(Counteditems)Examples:WeightVoltage
(Measuredcharacteristics)DCOVACollectingDataCorrectlyIsACriticalTaskNeedtoavoiddataflawedbybiases,ambiguities,orothertypesoferrors.Resultsfromflaweddatawillbesuspectorinerror.Eventhemostsophisticatedstatisticalmethodsarenotveryusefulwhenthedataisflawed.DCOVADevelopingOperationalDefinitionsIsCrucialToAvoidConfusion/ErrorsAnoperationaldefinitionisaclearandprecisestatementthatprovidesacommonunderstandingofmeaningIntheabsenceofanoperationaldefinitionmiscommunicationsanderrorsarelikelytooccur.Arrivingatoperationaldefinition(s)isakeypartoftheDefinestepofDCOVADCOVAEstablishingABusinessObjectiveFocusesDataCollectionExamplesOfBusinessObjectives:Amarketingresearchanalystneedstoassesstheeffectivenessofanewtelevisionadvertisement.Apharmaceuticalmanufacturerneedstodeterminewhetheranewdrugismoreeffectivethanthosecurrentlyinuse.Anoperationsmanagerwantstomonitoramanufacturingprocesstofindoutwhetherthequalityoftheproductbeingmanufacturedisconformingtocompanystandards.Anauditorwantstoreviewthefinancialtransactionsofacompanyinordertodeterminewhetherthecompanyisincompliancewithgenerallyacceptedaccountingprinciples.DCOVASourcesofDataPrimarySources:ThedatacollectoristheoneusingthedataforanalysisDatafromapoliticalsurveyDatacollectedfromanexperimentObserveddataSecondarySources:ThepersonperformingdataanalysisisnotthedatacollectorAnalyzingcensusdataExaminingdatafromprintjournalsordatapublishedontheinternet.DCOVASourcesofdatafallintofivecategoriesDatadistributedbyanorganizationoranindividualTheoutcomesofadesignedexperimentTheresponsesfromasurveyTheresultsofconductinganobservationalstudyDatacollectedbyongoingbusinessactivitiesDCOVAExamplesOfDataDistributedByOrganizationsorIndividualsFinancialdataonacompanyprovidedbyinvestmentservices.Industryormarketdatafrommarketresearchfirmsandtradeassociations.Stockprices,weatherconditions,andsportsstatisticsindailynewspapers.DCOVAExamplesofDataFromADesignedExperimentConsumertestingofdifferentversionsofaproducttohelpdeterminewhichproductshouldbepursuedfurther.Materialtestingtodeterminewhichsupplier’smaterialshouldbeusedinaproduct.Markettestingonalternativeproductpromotionstodeterminewhichpromotiontousemorebroadly.DCOVAExamplesofSurveyDataAsurveyaskingpeoplewhichlaundrydetergenthasthebeststain-removingabilitiesPoliticalpollsofregisteredvotersduringpoliticalcampaigns.Peoplebeingsurveyedtodeterminetheirsatisfactionwitharecentproductorserviceexperience.DCOVAExamplesofDataCollectedFromObservationalStudiesMarketresearchersutilizingfocusgroupstoelicitunstructuredresponsestoopen-endedquestions.Measuringthetimeittakesforcustomerstobeservedinafastfoodestablishment.Measuringthevolumeoftrafficthroughanintersectiontodetermineifsomeformofadvertisingattheintersectionisjustified.DCOVAExamplesofDataCollectedFromOngoingBusinessActivitiesAbankstudiesyearsoffinancialtransactionstohelpthemidentifypatternsoffraud.EconomistsutilizedataonsearchesdoneviaGoogletohelpforecastfutureeconomicconditions.Marketingcompaniesusetrackingdatatoevaluatetheeffectivenessofawebsite.DCOVADataIsCollectedFromEitherAPopulationorASamplePOPULATIONApopulationconsistsofalltheitemsorindividualsaboutwhichyouwanttodrawaconclusion.Thepopulationisthe“l(fā)argegroup”SAMPLEAsampleistheportionofapopulationselectedforanalysis.Thesampleisthe“smallgroup”DCOVAPopulationvs.SamplePopulationSampleAlltheitemsorindividualsaboutwhichyouwanttodrawconclusion(s)AportionofthepopulationofitemsorindividualsDCOVACollectingDataViaSamplingIsUsedWhenSelectingASampleIsLesstimeconsumingthanselectingeveryiteminthepopulation.Lesscostlythanselectingeveryiteminthepopulation.Lesscumbersomeandmorepracticalthananalyzingtheentirepopulation.DCOVAThingsToConsider/DealWithInPotentialSourcesOfDataIsthesourceofdatastructuredorunstructured?Howiselectronicdataformatted?Howisdataencoded?DCOVAStructuredDataFollowsAnOrganizingPrinciple&UnstructuredDataDoesNotAStockTickerProvidesStructuredData:Thestocktickerrepeatedlyreportsacompanyname,thenumberofshareslasttraded,thebidprice,andthepercentchangeinthestockprice.Duetotheirinherentstructure,datafromtablesandformsarestructureddata.E-mailsfromfivepeopleconcerningstocktradesisanexampleofunstructureddata.Inthesee-mailsyoucannotcountontheinformationbeingsharedinaspecificorderorformat.ThisbookdealsexclusivelywithstructureddataDCOVAAllOfTheMethodsInThisBookDealWithStructuredDataTousethetechniquesinthisbookonunstructureddatayouneedtoconverttheunstructuredintostructureddata.Formanyofthequestionsyoumightwanttoanswer,thestartingpointcan/willbetabulardata.DCOVADataCanBeFormattedand/orEncodedInMoreThanOneWaySomeelectronicformatsaremorereadilyusablethanothers.Differentencodingscanimpacttheprecisionofnumericalvariablesandcanalsoimpactdatacompatibility.Asyouidentifyandchoosesourcesofdatayouneedtoconsider/dealwiththeseissuesDCOVADataCleaningIsOftenANecessaryActivityWhenCollectingDataOftenfind“irregularities”inthedataTypographicalordataentryerrorsValuesthatareimpossibleorundefinedMissingvaluesOutliersWhenfoundtheseirregularitiesshouldbereviewed/addressedBothExcel&MinitabcanbeusedtoaddressirregularitiesDCOVAAfterCollectionItIsOftenHelpfulToRecodeSomeVariablesRecodingavariablecaneithersupplementorreplacetheoriginalvariable.Recodingacategoricalvariableinvolvesredefiningcategories.Recodingaquantitativevariableinvolveschangingthisvariableintoacategoricalvariable.Whenrecodingbesurethatthenewcategoriesaremutuallyexclusive(categoriesdonotoverlap)andcollectivelyexhaustive(categoriescoverallpossiblevalues).DCOVAASamplingProcessBeginsWithASamplingFrameThesamplingframeisalistingofitemsthatmakeupthepopulationFramesaredatasourcessuchaspopulationlists,directories,ormapsInaccurateorbiasedresultscanresultifaframeexcludescertainportionsofthepopulationUsingdifferentframestogeneratedatacanleadtodissimilarconclusionsDCOVATypesofSamplesSamplesNon-ProbabilitySamplesJudgmentProbabilitySamplesSimpleRandomSystematicStratifiedClusterConvenienceDCOVATypesofSamples:
NonprobabilitySampleInanonprobabilitysample,itemsincludedarechosenwithoutregardtotheirprobabilityofoccurrence.Inconveniencesampling,itemsareselectedbasedonlyonthefactthattheyareeasy,inexpensive,orconvenienttosample.Inajudgmentsample,yougettheopinionsofpre-selectedexpertsinthesubjectmatter.
DCOVATypesofSamples:
ProbabilitySampleInaprobabilitysample,itemsinthesamplearechosenonthebasisofknownprobabilities.ProbabilitySamplesSimple
RandomSystematicStratifiedClusterDCOVAProbabilitySample:
SimpleRandomSampleEveryindividualoritemfromtheframehasanequalchanceofbeingselectedSelectionmaybewithreplacement(selectedindividualisreturnedtoframeforpossiblereselection)orwithoutreplacement(selectedindividualisn’treturnedtotheframe).Samplesobtainedfromtableofrandomnumbersorcomputerrandomnumbergenerators.DCOVASelectingaSimpleRandomSampleUsingARandomNumberTableSamplingFrameForPopulationWith850ItemsItemNameItem#BevR. 001UlanX. 002. .. .. .. .JoannP. 849PaulF. 850PortionOfARandomNumberTable492808892435779002838116307275111000234012860746979664489439098932399720048494208887208401TheFirst5ItemsinasimplerandomsampleItem#492Item#808Item#892--doesnotexistsoignoreItem#435Item#779Item#002DCOVADecideonsamplesize:nDivideframeofNindividualsintogroupsofkindividuals:k=N/nRandomlyselectoneindividualfromthe1stgroupSelecteverykthindividualthereafterProbabilitySample:
SystematicSampleN=40n=4k=10FirstGroupDCOVAProbabilitySample:
StratifiedSampleDividepopulationintotwoormoresubgroups(calledstrata)accordingtosomecommoncharacteristicAsimplerandomsampleisselectedfromeachsubgroup,withsamplesizesproportionaltostratasizesSamplesfromsubgroupsarecombinedintooneThisisacommontechniquewhensamplingpopulationofvoters,stratifyingacrossracialorsocio-economiclines.PopulationDividedinto4strataDCOVAProbabilitySample
ClusterSamplePopulationisdividedintoseveral“clusters,”eachrepresentativeofthepopulationAsimplerandomsampleofclustersisselectedAllitemsintheselectedclusterscanbeused,oritemscanbechosenfromaclusterusinganotherprobabilitysamplingtechniqueAcommonapplicationofclustersamplinginvolveselectionexitpolls,wherecertainelectiondistrictsareselectedandsampled.Populationdividedinto16clusters.RandomlyselectedclustersforsampleDCOVAProbabilitySample:
ComparingSamplingMethodsSimplerandomsampleandSystematicsampleSimpletouseMaynotbeagoodrepresentationofthepopulation’sunderlyingcharacteristicsStratifiedsampleEnsuresre
溫馨提示
- 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請下載最新的WinRAR軟件解壓。
- 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
- 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁內(nèi)容里面會有圖紙預(yù)覽,若沒有圖紙預(yù)覽就沒有圖紙。
- 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
- 5. 人人文庫網(wǎng)僅提供信息存儲空間,僅對用戶上傳內(nèi)容的表現(xiàn)方式做保護處理,對用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對任何下載內(nèi)容負責(zé)。
- 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請與我們聯(lián)系,我們立即糾正。
- 7. 本站不保證下載資源的準確性、安全性和完整性, 同時也不承擔(dān)用戶因使用這些下載資源對自己和他人造成任何形式的傷害或損失。
最新文檔
- 流通環(huán)節(jié)培訓(xùn)材料
- 流行舞舞蹈培訓(xùn)課件
- 流程的培訓(xùn)教學(xué)課件
- 流感相關(guān)知識培訓(xùn)
- 2024-2025學(xué)年陜西省部分學(xué)校高二下學(xué)期5月月考歷史試題(解析版)
- 2024-2025學(xué)年山東省日照市高一下學(xué)期期中考試歷史試題(解析版)
- 2024-2025學(xué)年江蘇省淮安市協(xié)作體高二下學(xué)期期中考試歷史試題(解析版)
- 2026年企業(yè)環(huán)保責(zé)任與ISO14001環(huán)境管理體系模擬自測題
- 2026年企業(yè)培訓(xùn)師考試企業(yè)內(nèi)訓(xùn)技能及人力資源開發(fā)利用題目訓(xùn)練
- 2026年現(xiàn)代物流管理與實務(wù)操作題庫
- 左心耳封堵術(shù)課件
- 中醫(yī)醫(yī)院針灸進修總結(jié)
- 主動脈瘤護理查房
- 招聘費用預(yù)算及方案(3篇)
- 湖南省2025年中考歷史真題試卷及答案
- 癲癇患者急救護理
- 2025公務(wù)員能源局面試題目及答案
- T/CCIAS 009-2023減鹽醬油
- 云南省曲靖市2024-2025學(xué)年高三年級第二次教學(xué)質(zhì)量監(jiān)測思想政治試卷(含答案)
- 名著導(dǎo)讀《經(jīng)典常談》整部書章節(jié)內(nèi)容概覽
- 公司6S管理手冊
評論
0/150
提交評論