雨課堂學(xué)堂在線(xiàn)學(xué)堂云《Big Data Analysis》單元測(cè)試考核答案_第1頁(yè)
雨課堂學(xué)堂在線(xiàn)學(xué)堂云《Big Data Analysis》單元測(cè)試考核答案_第2頁(yè)
雨課堂學(xué)堂在線(xiàn)學(xué)堂云《Big Data Analysis》單元測(cè)試考核答案_第3頁(yè)
雨課堂學(xué)堂在線(xiàn)學(xué)堂云《Big Data Analysis》單元測(cè)試考核答案_第4頁(yè)
雨課堂學(xué)堂在線(xiàn)學(xué)堂云《Big Data Analysis》單元測(cè)試考核答案_第5頁(yè)
已閱讀5頁(yè),還剩41頁(yè)未讀, 繼續(xù)免費(fèi)閱讀

下載本文檔

版權(quán)說(shuō)明:本文檔由用戶(hù)提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請(qǐng)進(jìn)行舉報(bào)或認(rèn)領(lǐng)

文檔簡(jiǎn)介

注:不含主觀(guān)題第1題單選題(1分)WhydowesaydataislikeCrudeoil?Whichisnotthereason?()AItisvaluableBItneedstoberefinedCOnedatasetcanbeadaptedtobeusedfordifferentpurposeDItcanbesold第2題單選題(1分)TheModelofGenerating/ConsumingDatahasChangedinto()AFewcompaniesaregeneratingdata,allothersareconsumingdata.BAllofusaregeneratingdata,andallofusareconsumingdata.CSomecompaniesaregeneratingdata,someareconsumingdata.DSomeofusaregeneratingdata,andSomeofusareconsumingdata.第3題單選題(1分)AboutBigdataterm,whichdescriptionisnotsuitable()ABigdatacanbeanalyzedforinsightsofbetterdecisionsandstrategicbusinessmovesBJustlargeCBothstructuredandunstructuredDHard-to-managevolumesofdata第4題單選題(1分)Aboutthedatagenerationstages,whichsequenceiscorrect()AOperationandbusinesssystem,Perceptionstage,User-generatedcontentBOperationandbusinesssystem,User-generatedcontent,PerceptionstageCPerceptionstage,Operationandbusinesssystem,User-generatedcontentDPerceptionstage,User-generatedcontent,Operationandbusinesssystem第5題單選題(1分)Whichofthefollowingstageisthemainreasonofbigdata?()AOperationandbusinesssystemBUser-generatedcontentCPerceptionstageDsocialmediaTest1-2第1題單選題(1分)AccordingtoGartner,thereisestimated20%dataoforganizationis()data,theothermajorityis()data.()Astructured,unstructuredBunstructured,structuredCstructured,semi-structuredDunstructured,semi-structured第2題單選題(1分)Aboutstructureddata,comparedwithunstructureddata,whichdescriptionisNOTright?()AItisnormallyintheformoftablewithrowandcolumn.BItisOrganizedthedatainpredefinedformat.CItiseasytoprocessed.DItrequiresmorestorage.第3題單選題(1分)Aboutunstructureddata,comparedwithstructureddata,whichdescriptionisNOTright?()AItcannotbedisplayedinrows,columnsandrelationaldatabase.BTheyarenormallyimages,audio,video,wordprocessingfiles,e-mails,spreadsheets.CTheyrequiremorestoragebecausetheyarehugeamountandarenotwellorganized.DItiseasytomanageandprotectwithlegacysolutions,inthetraditionalway.第4題單選題(1分)Comparingthedatabaseandbigdata,whichonefirsthasschema,thenorganizethedataaccordingtotheschema.()ADatabaseBDatabaseandbigdataCBigdataDNoneoftheDatabaseandbigdataADatabaseBDatabaseandbigdataCBigdataDNoneoftheDatabaseandbigdata第5題單選題(1分)Thecorrectorderofthedatascaleincreasingis()AKBMBGBPBTBEBBKBMBGBTBPBEBCKBMBTBGBPBEBDKBMBGBTBEBPB第6題判斷題(1分)wecanfindonekindoftooltodealwithallthedatamanageproblemsoftheBigdata.()第7題判斷題(1分)wecanfindonekindoftooltodealwithallthedatamanageproblemsoftheDatabase.()Test1-3第1題單選題(1分)WhichdescriptionisnotsureaboutJimGray()ARelationaldatabasefounderBNauticalsportenthusiastCDividedscientificresearchintofourtypesofparadigmsDBigdatascientist第2題單選題(1分)ThecorrectChronologicallyorderofthefourParadigmsis()AEmpirical–Theoretical–Computational-DataexplorationBTheoretical-Empirical-Computational-DataexplorationCEmpirical-Computational-Theoretical-DataexplorationDEmpirical-Theoretical-Dataexploration-ComputationalTest1-4第1題單選題(1分)AmongtheBigdatacharacters,whichoneismostimportant.()AVolume&VelocityBVarietyCVeracityDValue第2題單選題(1分)WhichofthefollowingbigdatacharactersbestdescribesDataatRest?()AVolumeBVelocityCVeracityDValue第3題單選題(1分)WhichofthefollowingbigdatacharactersbestdescribesDatainMotion?()AVolumeBVarietyCVeracityDVelocity第4題單選題(1分)WhichofthefollowingbigdatacharactersbestdescribesDatainManyForms?()AVolumeBVarietyCVeracityDVelocity第5題單選題(1分)WhichofthefollowingbigdatacharactersbestdescribesDatainDoubt(whichmeansUncertaintyduetodatainconsistencyandincompleteness,ambiguities,latency,deception,modelapproximations)?()AVolumeBVarietyCVeracityDVelocity第6題單選題(1分)WhichofthefollowingbigdatacharacterslikePanningforgoldinthesand?()AValueBVarietyCVeracityDVelocityTest1-5第1題單選題(1分)Thecorrectbigdatalifecycleis()Adatagovernancedatacollecting,datastoringanddataanalyzingBdatacollecting,datagovernance,datastoringanddataanalyzingCdatacollecting,datastoring,datagovernanceanddataanalyzingDdatacollecting,datastoring,dataanalyzinganddatagovernance第2題單選題(1分)Thereducingriskofdecisionmakingsequenceis()AData,Information,Wisdom,KnowledgeBInformation,Data,Knowledge,WisdomCData,Information,Knowledge,WisdomDInformation,Data,Wisdom,Knowledge第3題單選題(1分)Amongthefollowingwhichoneisaboutindividualfacts,figures,signals,measurements?()ADataBInformationCWisdomDKnowledge第4題單選題(1分)Amongthefollowingwhichoneisaboutorganized,structured,categorized,useful,condensed,calculateddata?()ADataBInformationCWisdomDKnowledge第5題單選題(1分)Amongthefollowingwhichoneisaboutidea,learning,notion,concept,synthesized,compared,thought-out,discussed?()ADataBInformationCWisdomDKnowledge第6題單選題(1分)Amongthefollowingwhichoneisaboutunderstanding,integration,applied,reflectedupon,actionable,accumulated,principles,patterns,decision-makingprogress?()ADataBInformationCWisdomDKnowledge第7題單選題(1分)Thehistoryprogressofharnessingdataisthat(

)1)(

)reportingandhumananalysiscanbemadeonhistoricaldata2)(

)cananalyzethecurrentdatatoimprovebusinesstransaction3)(

)Real-TimeAnalyticsProcessingtomaketheRealtimedecisionandimproveRealtimebusinessresponseAOLAP:OnlineAnalyticalProcessing;OLTP:OnlineTransactionProcessing;RTAP:Real-TimeAnalyticsProcessing;BOLTP:OnlineTransactionProcessing;OLAP:OnlineAnalyticalProcessing;RTAP:Real-TimeAnalyticsProcessing;COLAP:OnlineAnalyticalProcessing;RTAP:Real-TimeAnalyticsProcessing;OLTP:OnlineTransactionProcessing;DOLTP:OnlineTransactionProcessing;RTAP:Real-TimeAnalyticsProcessing;OLAP:OnlineAnalyticalProcessing;第8題單選題(1分)Businessintelligenceevoledfrombothscaleandspeed,inthefollowingdiagram,whatarethetechnicsinthesquareNo.1234.

(

)

A.1-Datawarehouses,2-In-memoryRDBMS,3-DistributedDataStore,4-RealTime&SingleViewA1-Datawarehouses,2-In-memoryRDBMS,3-DistributedDataStore,4-RealTime&SingleViewB1-In-memoryRDBMS,2-Datawarehouses,3-DistributedDataStore,4-RealTime&SingleViewC1-Datawarehouses,2-DistributedDataStore,3-In-memoryRDBMS,4-RealTime&SingleViewD1-Datawarehouses,2-In-memoryRDBMS,3-RealTime&SingleView,4-DistributedDataStoreTest1-6第1題單選題(1分)Inthefollowingpicture,whataretherighttermsforeachnumber?ADatasources,Datastorage,Datacollection,DataProcessing,DataVisualization,ReportmonitoringBDatasources,Datacollection,Datastorage,DataVisualization,DataProcessing,ReportmonitoringCDatasources,Datacollection,Datastorage,DataProcessing,DataVisualization,ReportmonitoringDDatasources,Datacollection,Datastorage,DataProcessing,Reportmonitoring,DataVisualizationTest1-7第1題單選題(1分)Whenthevolumeofdatagetsbiggerandbigger,anysingletraditionalhigh-performanceservercannotsatisfytherequirement,moreserversareneeded.Whichiscalledexpand()AverticallyBhorizontallyCcentralizedDdistributed第2題單選題(1分)Distributedcomputing‘sideaistousethe()toachievethe()()Aredundancy,reliability;Breliability,redundancy;Credundancy,performance;Dreliability,performance;第3題單選題(1分)Thetwomaincomponentsofbigdataare()and().()ADistributedStorage,DistributedProcessingBDistributedCollection,DistributedProcessingCDistributedCollection,DistributedStorageDDistributedCollection,Distributedapplication第4題單選題(1分)InBigDataGeneralArchitecture,frombottomtoup,threebasiclayersofbigdatacomputingsystemare()ADataprocessingsystem;Datastoragesystem;Dataapplicationsystem;BDatastoragesystem;Dataprocessingsystem;Dataapplicationsystem;CDatacollectionsystem;Dataprocessingsystem;Datastoragesystem;DDatastoragesystem;Dataprocessingsystem;Datavisualizationsystem;第5題單選題(1分)Inbigdatageneralarchitecture,therearefourpartsindatastoragesystem,whichonebestdescribesthem?()ADatacollection,datamodeling,datastorageincludingdistributedfilesystemanddistributedDatabase,UnifiedDataAccessInterfaceBDatacollection,datapreprocessing,datastorageincludingdistributedfilesystemanddistributedDatabase,UnifiedDataAccessInterfaceCDatapreprocessing,datamodeling,datastorageincludingdistributedfilesystemanddistributedDatabase,UnifiedDataAccessInterfaceDDatapreprocessing,datamodeling,distributedfilesystem;distributedDatabase第6題單選題(1分)Inbigdatageneralarchitecture,therearethreepartsindataprocessingsystem,whichonebestdescribesthem?()ADatastorage,Dataprocessingalgorithm,computingengineandplatformBDatastorage,computingmodel,computingengineandplatformCDataprocessingalgorithm,computingmodel,computingengineandplatformDDataprocessingalgorithm,computingengine,platform第7題單選題(1分)Inbigdatageneralarchitecture,theUDAI-UnifiedDataAccessInterfaceisNOTtoaddressthe()issue.()Across-platformBheterogeneousCdistributedcomputingDinconsistency第8題判斷題(1分)Hadoopistheonlybigdataarchitecture.WebCrawler第1題第2題第3題第4題Test2-1第1題單選題(1分)Accordingtoorganizationboundary,dataresourcescanbedividedinto2categories.()Aonlinedataandofflinedata.Borganizationdataandgovernmentdata.Cinternaldataandexternaldata.DsystemdataandIoTdata.第2題單選題(1分)Whenyoucollectdatafrominternetyoushouldbeawareofsomeissues,whichoneisnotincluded()ADifferentITlevelandstructureofdifferentwebsite--nounifiedcollectionmethodBDifferentwebsiteshavedifferentcontrolpolicyofcrawlersCItsauthenticityanddataqualityareinferiortootherdataDwecollectthedifferentformofdataequally.Test2-2第1題單選題(1分)Themostoftenusedinternaldataacquisitiontoolis()ADatawarehouseBETL(Extract,Transform,load)CDataTriggerDIncrementaldataextraction第2題單選題(1分)()issimpleandintuitiveway,itextractsalldataintheentiresourcedatastorageeverytime.()AIncrementalExtractionBFullextractionCTimestampExtractionDTrigger第3題單選題(1分)()extractnewormodifieddatainthedatabasesincethelastextraction,atthesametime,itnormallywouldnothaveabigimpactontherunningbusinesssystem.()AIncrementaldataextractionBFullextractionCTimestampExtractionDTrigger第4題單選題(1分)()EvaluatethechangeddataindataextractionthroughtheDB'sownlog.()ALogcomparisonBTimestampCTriggersDFulltablecomparison第5題單選題(1分)()Addandmodifythetimestampfieldvaluewhileupdatingthecorrespondingrecorddata.Comparingthevalueofthesystemtimeandthetimestamptodecideextractornot.()ALogcomparisonBTimestampCTriggersDFulltablecomparison第6題單選題(1分)CreateatriggeronthedatatableWheneverthesourcetabledatachanges,thechangeddataiswrittentothetemporarytablethroughthecorrespondingtrigger.Theextractionthreadextractsdatafromthetemporarytable.()ALogcomparisonBTimestampCTriggersDFulltablecomparison第7題單選題(1分)In()dataextractionmethod,theETLtoolcreatesanMD5temporarytablewithasimilarstructureforthetabletobeextractedinadvance.ThetemporarytablerecordstheprimarykeyofthesourcetableandtheMD5checkcodecalculatedbasedonthedataofallfields.()ALogcomparisonBTimestampCTriggersDFulltablecomparison第8題單選題(1分)WhichofthefollowingisNOTdatatransformcomponent?()AFieldmappingBDatacalculationCDatasplitDEliminateduplication第9題單選題(1分)WhichoneisNOTthemethodofdataloading?()ASQLstatementstoinsert,updateanddeletedataBFullextractionCBulkCopyProgramDAPITest2-3第1題單選題(1分)WhichoneisnotNetworkbigdatacharacteristics?()AMulti-sourceheterogeneousBHighnoiseCInteractivityDStructured第2題單選題(1分)Webcrawlercrawlingprocessis(B)a)AlistofuniformresourceaddressescalledseedURLanduseitasthelinkentryforcrawling.WhenthecrawlervisitstheseseedURLs,itidentifiesalltheneededlinksonthepageandaddsthemtothequeuetobecrawled.b)PutthealreadydownloadedURLintothecrawledURLlistc)ExtractthenewURLintotheURLqueuetobecrawledandputtheminthetobecrawledURLqueueaccordingtostrategyd)Thewebpagelinksaretakenoutfromthequeuetobecrawled,thenReadURL,dotheDNSresolution,andwebpagesweredownloadintotheDownloadedweblibrary.e)alltheprocesswillenduntilthequeueforcrawlingisempty.A

abcdeBadbceCacbdeDadcbe第3題單選題(1分)Howtodealwithfan-outURLsinseedURLs,whichisthelinksofthelink,whichinvolveswebcrawlercrawlingstrategies.WhichoneisnottheoftenusedCrawlingstrategies()ADepthfirstBBreadthfirstCFirstIn-FirstoutDPartialPageRankStrategy第4題單選題(1分)Inthediagramusingthebreadthfirstcrawlingstrategy,whichoneistherightsequence?()AM1-M2-M5-M8-M6-M3-S7-S4BM1-M2-M3--S4-M5-M6-S7-M8CM1-M2-M5-M6-M8-M3-S7-S4DM1-M2-M5-M6-M3-S7-M8-S4第5題單選題(1分)The(

)strategyassignsthesame"goldcoins"toeachwebpage.WheneverapagePisdownloaded,the"goldcoins"ownedbyPareequallydistributedtothelinkedpagescontainedinthewebpage.Thelinksinthequeuetobecrawledaresortedby"goldcoins"

APageRankBOPICCDepthfirstDBreadthfirst第6題單選題(1分)Thecrawlingtaskishuge,itcan’teasilybedonebyonestand-alonecrawler,soweneedDistributedwebcrawler.Thereare3basicDistributedArchitecturemodel,whichoneisnotoneofthem.()AMaster-slaveBPeertopeerCMixedstructureDHybridTest2-4第1題單選題(1分)WhatisthecontentavailableontheInternet,butthosewebpages,filesorotherhigh-quality,authoritativeinformationthattraditionalsearchenginesareunabletoindexduetotechnicallimitationsorareunwillingtoindexaftercarefulconsideration.()ASurfaceWebBDeepWebCDarkWebDNoneofthem第2題單選題(1分)Whichofthefollowingisnotthefeatureofdeepwebinformation?()AHighlyrelatedtoinformationneeds,marketsandfields.BFastestgrowingnewtypeofinformationontheInternet.CMorethanhalfisstoredinthematicdatabases.DItcanbesearchedbysearchengine.第3題單選題(1分)Deepwebcontentincludes(

)

1Pagesthatarenotreferredtobysearchenginesduetolackofdirectedlinks

2Non-webfilesaccessibleontheweb,suchaspicturefiles,Pdfandworddocuments,etc.

3Adynamicpageobtainedbyqueryingtheback-endonlinedatabasebyfillingintheform.

4Contentthatrequiresregistrationorotherrestrictionstoaccess.A1234B124C123D234第4題單選題(1分)WhichoffollowingdescriptionaboutthesearchinterfaceofdeepwebisNOT

correct(

)AhascomplexinterfacesBsupportsqueriesonseveralattributesCextractscontentsfromdatabasesDeasytofind第5題單選題(1分)WhichonecompletelydescribetheDeepwebdataacquisitionmethodAQueryinterfaceidentificationandfillintheformautomaticallyBParseHTMLformsorperformgrammaticalanalysisonHTMLformstoautomaticallydiscoverdeepwebdataresourcesC

AssociateHTMLformswithspecificfieldstorealizeautomaticfillingofformsDDomain-independentdetection:Iterativelyobtainquerykeywordsfromqueryresultsbasedonsampling,soastoobtainasmanyqueryresultsaspossiblewithfewerqueries.Test3-1第1題單選題(1分)WhichonecanNOThelppreventingdirtydatafromappearing?()AUnifyattributevalueencodingofmultipledatasourcesBGivetheattributenameandattributevalueasclearaspossibleCUseoptionsasmuchaspossibleforkeyvaluesDManuallyfillingintheentry第2題單選題(1分)TasksofDataPre-processingdoesNOTinclude()ADatacleaningBDatatransformationCDatareductionDDatadefinition第3題單選題(1分)Datacleaningtechnologydoesnotinclude()ADatatransformationBCleaningofmissingdataCDeduplicationofdataDPerformanomalydetectiononthedataset第4題單選題(1分)DataReductiondoesnotinclude()ADimensionalityreductionprocessingofhigh-dimensiondataBReducetheamountofdataCRandomdeletesomedataDDataDiscretizationTechnologyTest3-2第1題單選題(1分)Integrityconstraintsbelongstothedataqualitycategory()ASingledataresource,modellevelBSingledataresource,instancelevelCMultipledataresource,modellevelDMultipledataresource,instancelevel第2題單選題(1分)Uniquenessbelongstothedataqualitycategory()ASingledataresource,modellevelBSingledataresource,instancelevelCMultipledataresource,modellevelDMultipledataresource,instancelevel第3題單選題(1分)Attributedependencebelongstothedataqualitycategory()ASingledataresource,modellevelBSingledataresource,instancelevelCMultipledataresource,modellevelDMultipledataresource,instancelevel第4題單選題(1分)Spellingerrorsbelongstothedataqualitycategory()ASingledataresource,modellevelBSingledataresource,instancelevelCMultipledataresource,modellevelDMultipledataresource,instancelevel第5題單選題(1分)Redundantandrepeatedrecordsbelongstothedataqualitycategory()ASingledataresource,modellevelBSingledataresource,instancelevelCMultipledataresource,modellevelDMultipledataresource,instancelevel第6題單選題(1分)Attributevalueconflictbelongstothedataqualitycategory()ASingledataresource,modellevelBSingledataresource,instancelevelCMultipledataresource,modellevelDMultipledataresource,instancelevel第7題單選題(1分)Namingconflict(usingthesamenamefordifferentdataobjectsorusingdifferentnamesforthesamedataobject)belongstothedataqualitycategory()ASingledataresource,modellevelBSingledataresource,instancelevelCMultipledataresource,modellevelDMultipledataresource,instancelevel第8題單選題(1分)Structureconflict(refertodifferentwaystorepresentthesamedataobjectindifferentdatasources)belongstothedataqualitycategory()ASingledataresource,modellevelBSingledataresource,instancelevelCMultipledataresource,modellevelDMultipledataresource,instancelevel第9題單選題(1分)Differentrepresentationsofvaluesbelongstothedataqualitycategory()ASingledataresource,modellevelBSingledataresource,instancelevelCMultipledataresource,modellevelDMultipledataresource,instancelevelTest3-3第1題單選題(1分)WhichofthefollowingisNOToneofthemaindatacleaningtasks?()ARepeatrecordcleaningBMissingvaluecleaningCEliminatingnoisedataDDeletesomeredundantattributeswhenmergingdifferenttables第2題單選題(1分)Howtodeterminewhethertworecordsareduplicates?()AComparetherelatedattributesoftworecordsaccordingtothesimilarityofeachattributeandtheweightoftheattribute.BComparemanuallybytechniciansCComparebyDatabasesupportDNoneofabove第3題單選題(1分)3.Missingvaluesmustbeinferredandaddedby()1)

Ignorethisrecord2)

Usedefault3)

Useattributeaverage4)

Usetheaverageofsimilarsamples5)

PredictthemostlikelyvalueA1234B2345C12345D1345第4題單選題(1分)WhichofthefollowingisNOToneofthemaineliminatingDatanoisemethods?()ABinning/splitbinalgorithmBClusteringAlgorithmCRegressionalgorithmDFunctionalgorithm第5題單選題(1分)Whensmoothingthenoisydata,whichoneisNOTtheusualmethod?ASmoothbyaverageBSmoothbytherandomvalueCSmoothaccordingtotheboundaryvalueDSmoothaccordingtothemedianTest3-4第1題單選題(1分)WhenyouIntegratedatafrommultipledatasourcesintoaconsistentstorage,toensurethedataquality,whichofthefollowingdatapreprocessingtaskthatisNOTnecessarytoperform()APatternmatchingBDataredundancyprocessingCDatavalueconflictsolvingDDatacalculation第2題單選題(1分)Inordertofacilitateefficientanalysis,whicharethenormaldatatransformationjobsyoucando()?

1)DataSmoothing2)DataAggregation

3)Datageneralization

4)DataNormalization5)AttributeconstructionA1234B2345C12345D1345Test3-5第1題單選題(1分)WhichofthefollowingstatementofdatareductionisNOTright?()ADatareduction(subtraction)technologyisusedtohelpobtainacondenseddatasetfromtheoriginalhugedataset,andmakethiscondenseddatasetmaintaintheintegrityoftheoriginaldatasetBDataanalysisonthecondenseddatasetisobviouslyefficienthigher,andtheresultsofanalysisarebasicallythesameasthoseobtainedbyusingtheoriginaldatasetCThetimespentondatareductioncouldexceedor"offset"thetimesavedbyanalysisonthereduceddata.DThedataobtainedbythereductionismuchsmallerthantheoriginaldata,butcanproducethesameoralmostthesameanalysisresults.第2題單選題(1分)WhichofthefollowingisNOTthedimensionalityreduction?()AWavelettransformationBAttributesubsetselectionCPrincipalcomponentanalysisDDataCubeAggregation第3題單選題(1分)WhichofthefollowingisNOTthenumerosityreduction?()APrincipalcomponentanalysisBDataCubeAggregationCClusteringDSampling第4題單選題(1分)1.Whichofthefollowingarethechoicesofattributessubsetselectionmethods?

(C)1)ForwardStepwiseAttributessubsetselection2)BackwardStepwiseAttributessubsetselection3)Combineforwardselectionandbackwarddeletion4)Principalcomponentanalysis5)Reductionbasedonstatisticalanalysis6)Decisiontree(decisiontree)inductionA12346B12345C12356D123456第5題單選題(1分)Whatattributessubsetselectionmethodshowedinthediagram?

AForwardStepwiseAttributessubsetselectionBBackwardStepwiseAttributessubsetselectionCCombineforwardselectionandbackwarddeletionDDecisiontree(decisiontree)induction第6題單選題(1分)Whatattributessubsetselectionmethodshowedinthediagram?AForwardStepwiseAttributessubsetselectionBBackwardStepwiseAttributessubsetselectionCCombineforwardselectionandbackwarddeletionDDecisiontree(decisiontree)induction第7題單選題(1分)AboutthePrincipalcomponentanalysis-PCA,whichstatementiswrong?()APrincipalcomponentanalysissearchestoobtainc-dimensionalorthogonalvectorsthatbestrepresentthedata.BPrincipalcomponentanalysisistheNumerosityreductionmethods.CTheoriginaldatacanbeprojectedintoasmallerspacetoachievedatacompression.DPrincipalcomponentanalysisislossycompression.Test4-1第1題單選題(1分)Datamodelingcouldincludedefining.

(

)1)

Metadata2)

Datastructure3)

Attributes4)

Valuerange5)

Associationrelationship6)

Consistency7)

TimelinessA12345B1234567C134567D123567第2題單選題(1分)Inthedatastoringsystem,itcaninclude4parts,thereasonableprocessingsequencefrombottomtotopcouldbe(

)1)Datacollectionandmodeling2)UnifieddataAccessInterface3)DistributedFileSystem4)DistributedDatabaseandDataWarehouseA1342B1234C1324D1423第3題單選題(1分)Whydoweneeddatamodeling,becauseitcansupport()ADatastoragestructuredesignBDatabasedesignCCalculationmodelDApplicationdesign第4題單選題(1分)Basedontherequirementwecanbuildthebusinessmodel,itincludes()and().()AConceptualModel,LogicModelBLogicModel,PhysicalModelCProcessmodel,DatamodelDProcessmodel,logicalmodel第5題單選題(1分)AboutDataModelingdesignlevelsdescriptions:Whichoneiscorrectmatching?(C)1)Basedontheuser'sdatafunctionrequirements.functionsandassociationrelationshipsareobtained,EntityClasscorrespondingtothebusinesselementsandfunctions.2)Moredetailsofdataentities,includingprimarykeys,foreignkeys,attributes,indexes,relationships,constraints,andevenviews,withdatatables,datacolumns,valueranges,object-orientedclasses,XMLtagsandotherformstodescribe.3)Thestorageimplementationofdata,includingdatapartition,datatablespace,anddataintegration.A1-Conceptualmodeldesign2-physicalmodeldesign3-logicalmodeldesignB1-Logicalmodeldesign

2-Physicalmodeldesign3-ConceptualmodeldesignC1-Conceptualmodeldesign2-logicalmodeldesign3-PhysicalmodeldesignD1-Physicalmodeldesign

2-Conceptualmodeldesign3-logicalmodeldesignTest4-2第1題單選題(1分)InHDFS,thenamenodeandthedatanodehavetheirownresponsibilities,selecttheresponsibilitiesofnamenodeanddatanoderespectively.Namenodes(),

Datanodes()

1)Realizethemappingofdatablockstothelocalfilesystemofthedatanode2)Managefilesystemnamespace3)Storefiledatablock4)Save“filetodatablocktodatanode”mappingrelationship5)Schedulingclientaccesstofiles6)storetheDatablocksonthelocaldisk7)StoretheMetadatainmemoryforquickaccessA1237,456B2457,136C1245,367D2456,137第2題單選題(1分)TherightorderofwritingtotheDatanodesinHDFSis()

a)

DistributedFileSystemmakesanRPCcalltothenamenodetocreateanewfileinthefilesystem’snamespace,withnoblocksassociatedwithit.b)

Theclientcreatesthefilebycallingcreate()methodonDistributedFileSystem.c)

Thelistofdatanodesformsapipeline,anddefaultreplicationlevelisthree,sotherearethreenodesinthepipeline.TheDataStreamerstreamsthepacketstothefirstdatanodeinthepipeline,whichstoresthepacketandforwardsittotheseconddatanodeinthepipeline.d)

Thenamenodeperformsvariouscheckstomakesurethefiledoesn’talreadyexistandtheclienthastherightpermissionstocreatethefile.Ifallthesecheckspass,thenamenodemakesarecordofthenewfile;otherwise,filecreationfailsandtheclientisthrownanIOException.e)

TheDistributedFileSystemreturnsanFSDataOutputStreamfortheclienttostartwritingdatatodatanode.FSDataOutputStreamwrapsaDFSOutputStreamwhichhandlescommunicationwiththedatanodesandnamenode.f)Astheclientwritesdata,DFSOutputStreamsplitsitintopackets,whichitwritestoaninternalqueue,calledthedataqueue.ThedataqueueisconsumedbytheDataStreamer,whichisresponsibleforaskingthenamenodetoallocatenewblocksbypickingalistofsuitabledatanodestostorethereplicas.g)

Similarly,theseconddatanodestoresthepacketandforwardsittothethird(andlast)datanodeinthepipeline.h)

DFSOutputStreamalsomaintainsaninternalqueueofpacketsthatarewaitingtobeacknowledgedbydatanodes,calledtheackqueue.Apacketisremovedfromtheackqueueonlywhenithas

溫馨提示

  • 1. 本站所有資源如無(wú)特殊說(shuō)明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請(qǐng)下載最新的WinRAR軟件解壓。
  • 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請(qǐng)聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶(hù)所有。
  • 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁(yè)內(nèi)容里面會(huì)有圖紙預(yù)覽,若沒(méi)有圖紙預(yù)覽就沒(méi)有圖紙。
  • 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
  • 5. 人人文庫(kù)網(wǎng)僅提供信息存儲(chǔ)空間,僅對(duì)用戶(hù)上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理,對(duì)用戶(hù)上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對(duì)任何下載內(nèi)容負(fù)責(zé)。
  • 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請(qǐng)與我們聯(lián)系,我們立即糾正。
  • 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時(shí)也不承擔(dān)用戶(hù)因使用這些下載資源對(duì)自己和他人造成任何形式的傷害或損失。

評(píng)論

0/150

提交評(píng)論