版權(quán)說(shuō)明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請(qǐng)進(jìn)行舉報(bào)或認(rèn)領(lǐng)
文檔簡(jiǎn)介
1、,Software and Services Group,Hadoop: the Intel Way,(Hadoop的英特爾之道),Bring New Analytics Capabilities to Hadoop Stack,何京翔,英特爾亞太研發(fā)有限公司總經(jīng)理,#,Security & Trust,Workload Consolidation,Cloud and IOT: More Users, More Device, More Data Immersive Experiences Cloud,Connectivity Data Analytics Software and Servi
2、ces Group,Open Cloud Architecture,#,Software and Services Group,Intels Vision,This decade we will create and extend computing technology to connect and enrich the lives of every person on earth,Software and Services Group # 4,Sensor Readin g,Log,Tabl e,Image,Document,Existing IT & Data,RDBM S EDW,Da
3、ta Marts,Systems BI All of Your Big Data (Structured & Unstructured),Our Big Data Goal: Make Hadoop the Foundation of Next-Gen Data Analytics Platform Data Mining and Analytics,Business Intelligenc e,Statistic Modeling,Machine Learning,# 5,HBase,HDFS,Hive,Base Station s,3G,Instantaneous query of 3G
4、records by subscribers Software and Services Group,User,Segmentation MapReduce ETL,Hadoop in Telecom Carrier Network Optimizations,Hive,Instantaneous,query (e.g., road image),Legacy applications,MapReduce HBase Stream processing (e.g., real-time road conditions) Software and Services Group # 6,Hadoo
5、p in Smart City Data mining (e.g., vehicle,tracking),Hadoop的英特爾之道,更易用 (Reduced Complexity) 更高效,企業(yè)級(jí)解決方案 Enterprise-Grade Solution 即時(shí)分析 (Instantaneous Analysis) 英特爾Hadoop發(fā)行版, 穩(wěn)定的企業(yè)級(jí)軟件產(chǎn) 品 針對(duì)垂直行業(yè)的功能 增強(qiáng),前沿技術(shù)開(kāi)發(fā) Advanced Development “Project Panthera”, Advanced development and path-finding Open source and
6、community driven,(Improved Efficiency) Bring New Analytics Capabilities to Hadoop Stack Software and Services Group # 7,英特爾Hadoop發(fā)行版 優(yōu)化的大數(shù)據(jù)處理軟件產(chǎn)品,英特爾,Hadoop Manage r 安裝、部署、 配置、監(jiān)控、 告警和訪問(wèn)控 制,利用硬件新技術(shù)進(jìn)行優(yōu)化 針對(duì)行業(yè)的功能增強(qiáng),應(yīng)對(duì)不同行業(yè)的大數(shù)據(jù)挑戰(zhàn) 數(shù)據(jù)分析、統(tǒng)計(jì)和挖掘,Mahout,機(jī)器學(xué)習(xí),R 數(shù)據(jù)統(tǒng)計(jì),Hive,Pig,數(shù)據(jù)流處理語(yǔ)言,可靠的分布式文件系統(tǒng) Software and Servi
7、ces Group # 8,穩(wěn)定的企業(yè)級(jí)Hadoop發(fā)行版 為Hadoop提供即時(shí)數(shù)據(jù)處理能力 數(shù)據(jù)處理,工具集,from Revolution Analytics 交互式數(shù)據(jù)倉(cāng)庫(kù) MapReduce 穩(wěn)定高效的分布式計(jì)算框架 分布式、高維數(shù)據(jù)庫(kù)HBase HBase 0.94的改進(jìn)和創(chuàng)新,提供即時(shí)數(shù)據(jù)處理 HDFS,Sqoop 關(guān)系數(shù)據(jù)ETL工具 Flume 日志收集工具 Zookeeper 分布式協(xié)作服務(wù),SQL engine for,Hive/MapReduce, Better integration with existing infrastructure using SQL,HBas
8、e, Document,semantics & significantly speedup query processing on,HBase Software and Services Group # 9, Efficient utilization,of new HW platform technologies,“Project Panthera” Open source initiatives to enable advanced analytics capabilities on Hadoop Document store on,#,Software and Services Grou
9、p,即時(shí)分析 (Instantaneous Analysis),10,Instantaneous analysis with greatly enhanced HBase Stream new data into HBase for analysis in real time, Support high update rate workloads (to keep the system always up to,date), Allow very low latency, online data serving Etc.,#,11,Interactive Query on HBase (英特爾
10、Hadoop發(fā)行版) 10X faster than MapReduce For certain queries on HBase (e.g., group-by aggregation),HBase Query Engine Layer, ,Fast, distributed aggregations directly inside HBase Parallel scanning over multiple regions Advanced, distributed filtering (CRC32 comparator, fuzzy row,filter, etc.),HBase Quer
11、y Engine as New Hive,Backend Most “SELECT” automatically optimized to use HBase Query Engine “WHERE” using advanced scanner/filter “GROUP-BY” using distributed,aggregations “JOIN” stills go to MapReduce Software and Services Group,#,12,A Document Store on HBase (“Project Panthera”) Up-to 3x storage
12、reduction and 3x query speedup For Hive/MapReduce query processing on HBase (See and HBASE-6800) DOT (Document Oriented Table) on HBase, ,Each row contains a collection of documents Each document contains a collection of fields A document is mapped to a HBase column and serialized using Avro Complet
13、e transparent to existing HBase applications Software and Services Group,#,Software and Services Group,更易用 (Reduced Complexity),13, Better data mining and statistics capabilities, Full-text indexing and search, Statistic modeling with R language, Better integration with existing infrastructures, Geo
14、-distributed datacenters Full SQL support for OLAP,#,14,Full-Text Indexing and Search (英特爾Hadoop發(fā)行版) Full-text indexing and near real-time search for advanced data mining (E.g., log and click stream analysis, healthcare record analysis, etc.),Incremental full-text indexing on HBase Full-text indexin
15、g for semi-structured data (text, strings, numbers, etc.) Index incrementally built when records inserted or updated Support very high data insertion / update rate,Near real-time search Distributed, keyword or logical expression based search Zero delay of searching latest data that are just inserted
16、 Software and Services Group,#,Software and Services Group,Bring R Statistics into Hadoop,(英特爾Hadoop發(fā)行版),15,Distributed Statistic Modeling on Hadoop using R language,16,Data Center A Virtual Big Table,Cross-Datacenter BigTable/HBase (英特爾Hadoop發(fā)行版) A virtual Big Table overlaid over existing geo-distr
17、ibuted data centers, ,Global table view Data stored in geo-distributed, ,data centers Better locality & higher availability Data transfer eliminated through distributed,aggregation Data Center C Data Center B Async Replication Software and Services Group #,17,An analytical SQL engine for Hive/MapRed
18、uce (“Project Panthera”) Goal: Provide Full SQL support for OLAP in Hadoop Required by business users, enterprise applications, 3rd party tools (e.g., BI applications), etc. (See and HIVE-3472),Hive Parser,Hive-AST,HiveQL,Driver,Query,* Software and Services Group #,(Open Source) SQL Parser*,SQL- AS
19、T,SQL-AST Analyzer & Translator Subquery Multi-Table Unnesting SELECT ,Hive Semantic Analyzer INTERSECT MINUS Support Support,Hadoop MR,SQL,Hive- AST,#,Software and Services Group,更高效 (Improved Efficiency),18, Performance benchmarks & tools, Efficient utilizing of new HW platform technologies (e.g., SSD,infiniband),#,19,英特爾Hadoop發(fā)行版高效支撐海量移動(dòng)上網(wǎng) 記錄分析 聯(lián)通全國(guó)移動(dòng)用戶上網(wǎng)記錄查詢分析系統(tǒng), ,國(guó)內(nèi)首個(gè)基于Hadoop/HBase的商用電信服務(wù) 系統(tǒng) 系統(tǒng)部署 英特爾Hadoop發(fā)行版 滿足高性能的數(shù)據(jù)導(dǎo)入和快速查詢。 穩(wěn)定、易于部署和管理的企業(yè)級(jí)方案。 180+節(jié)點(diǎn)Hadoop/HBase集群 系統(tǒng)性能指標(biāo) 上網(wǎng)記錄入庫(kù)時(shí)間:一般小于30分鐘,實(shí) 際約10分鐘 具備存儲(chǔ)全國(guó)移動(dòng)用戶不小于6個(gè)月的原始 上網(wǎng)記錄能力 統(tǒng)計(jì)分析的中間報(bào)表數(shù)據(jù)保存不小于5年 上網(wǎng)記錄查詢速度:不高于1秒 支持并發(fā)查詢數(shù)目:1000請(qǐng)求/秒 Softwar
溫馨提示
- 1. 本站所有資源如無(wú)特殊說(shuō)明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請(qǐng)下載最新的WinRAR軟件解壓。
- 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請(qǐng)聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
- 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁(yè)內(nèi)容里面會(huì)有圖紙預(yù)覽,若沒(méi)有圖紙預(yù)覽就沒(méi)有圖紙。
- 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
- 5. 人人文庫(kù)網(wǎng)僅提供信息存儲(chǔ)空間,僅對(duì)用戶上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理,對(duì)用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對(duì)任何下載內(nèi)容負(fù)責(zé)。
- 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請(qǐng)與我們聯(lián)系,我們立即糾正。
- 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時(shí)也不承擔(dān)用戶因使用這些下載資源對(duì)自己和他人造成任何形式的傷害或損失。
最新文檔
- 2025至2030在線藝術(shù)教育平臺(tái)運(yùn)營(yíng)模式分析及增長(zhǎng)動(dòng)力與戰(zhàn)略投資研究報(bào)告
- 公務(wù)員閬中市委組織部關(guān)于閬中市2025年考調(diào)35人備考題庫(kù)及答案詳解1套
- 二十大安全生產(chǎn)論述課件
- 2025至2030中國(guó)征信智能預(yù)警系統(tǒng)建設(shè)與實(shí)施效果研究報(bào)告
- 2025至2030中國(guó)肉禽行業(yè)自媒體營(yíng)銷(xiāo)效果評(píng)估與流量變現(xiàn)策略研究報(bào)告
- 2026中國(guó)氣動(dòng)鼓泵行業(yè)未來(lái)趨勢(shì)與投資前景預(yù)測(cè)報(bào)告
- 云南中煙工業(yè)有限責(zé)任公司2026年畢業(yè)生招聘啟動(dòng)備考題庫(kù)有答案詳解
- 2025至2030中國(guó)新能源汽車(chē)產(chǎn)業(yè)鏈全景解析及投資機(jī)會(huì)研究報(bào)告
- 2026年西安交通大學(xué)第一附屬醫(yī)院醫(yī)學(xué)影像科招聘勞務(wù)派遣助理護(hù)士備考題庫(kù)帶答案詳解
- 安徽東新產(chǎn)業(yè)服務(wù)有限公司2025年招聘?jìng)淇碱}庫(kù)及參考答案詳解1套
- 散文系列《補(bǔ)鞋子的人》精-品解讀
- 2025國(guó)開(kāi)本科《公共部門(mén)人力資源管理》期末歷年真題(含答案)
- 養(yǎng)老院對(duì)護(hù)工規(guī)范管理制度
- 農(nóng)行內(nèi)控制度匯編
- 2025年企業(yè)黨支部書(shū)記年度述職報(bào)告
- 2026年孝昌縣供水有限公司公開(kāi)招聘正式員工備考題庫(kù)及參考答案詳解1套
- 絕經(jīng)后宮頸上皮內(nèi)病變處理要點(diǎn)2026
- 2025年校長(zhǎng)個(gè)人述職報(bào)告:凝心聚力抓落實(shí) 立德樹(shù)人開(kāi)新局
- 瀝青混凝土面板全庫(kù)盆防滲施工質(zhì)量通病防治手冊(cè)
- 光伏電站故障處理培訓(xùn)大綱
- 設(shè)備維保三級(jí)管理制度
評(píng)論
0/150
提交評(píng)論