面向未來分解式人工智能系統(tǒng)的下一代共封裝光學(xué)器件 Next-generation Co-Packaged Optics for Future Disaggregated AI Systems_第1頁
面向未來分解式人工智能系統(tǒng)的下一代共封裝光學(xué)器件 Next-generation Co-Packaged Optics for Future Disaggregated AI Systems_第2頁
面向未來分解式人工智能系統(tǒng)的下一代共封裝光學(xué)器件 Next-generation Co-Packaged Optics for Future Disaggregated AI Systems_第3頁
面向未來分解式人工智能系統(tǒng)的下一代共封裝光學(xué)器件 Next-generation Co-Packaged Optics for Future Disaggregated AI Systems_第4頁
面向未來分解式人工智能系統(tǒng)的下一代共封裝光學(xué)器件 Next-generation Co-Packaged Optics for Future Disaggregated AI Systems_第5頁
已閱讀5頁,還剩34頁未讀, 繼續(xù)免費閱讀

下載本文檔

版權(quán)說明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請進(jìn)行舉報或認(rèn)領(lǐng)

文檔簡介

Next-generationCo-Packaged

OpticsforFutureDisaggregatedAISystems

SajjadMoazeni

AssistantProfessor

UniversityofWashington,Seattle

1|

2|

Co-packagedOptics(CPO)

[Broadcom]

Large-scaledata-center

networkingandswitches

&

Riseofdata-intensiveAI/ML

applications

Demandssignificantly

largeroff-packageI/O

bandwidths!

3|

Hyper-scaleData-centers

?ElectricalLinks:Energy/BWdensityscalabilityissues

?Futurerequirements:multi-Tb/s/mm&Sub-5pJ/b

[Broadcom]

SiPhPlatformfor

OpticalI/O

Co-packaged

PICwithDSP

Switchwith

OpticalI/O

202120232025

Co-packagedOpticsfor>51.2Tb/s

4|

DemandedBandwidthsbyAI/ML

AI/MLDemand

DoublingEvery

3.5Month!

Moore’sLaw

[OpenAI]

5|

DemandedBandwidthsbyAI/ML

100x8xH100

[NvidiaDGXSuperpod]

?+10Tb/soff-packagebandwidthswillbesoonrequiredforGPUs

?OpticallyconnectedGPU&NVSwitchesistheonlyviablesolution

?Scalingoutwillbealsoeasierwithopticalinterconnects

6|

DisaggregatedComputeSystems

?Data-center:

–Improvedutilizationanddynamic

resourceallocations

?AI/MLWorkloads:

–DistributedTrainingandParallelism

–NVMe-over-fabric,CXL,…

–DPU,etc.

–Co-packagedOptics(CPO)

[]

7|

Goalsfor

Optics(CPO)

Co-packaged

[NvidiaGTC2022]

[AMDISSCC2023]

?Silicon

Photonics

[NvidiaGTCDec2020]

?Micro-ringresonator(MRM)basedopticaltransceivers(TRx)

[DallyOFC2022]

?Wavelengthdivisionmultiplexing(WDM)

8|

Micro-ringModulator(MRM)

WavelengthWavelength

?Resonancewavelength:λ0=neffL/m,m=1,2,3,...

–Q-factor:Q=λ0/Δλ

?Compactdevice(radiusof5μm)

–Energy&areaefficientmodulator/filter,Onlya20fFload!

?Supportingwavelengthdivisionmultiplexing(WDM)

9

9|

MRMbasedOpticalLinks

?ModulationScheme:

1.Deplete/InjectcarriersusingPNjunctions

10

10|

MRMbasedOpticalLinks

?ModulationScheme:

1.Deplete/InjectcarriersusingPNjunctions

2.ΔfreecarriersΔindexofrefraction[Carrier-PlasmaEffect]

11

11|

MRMbasedOpticalLinks

?ModulationScheme:

1.Deplete/InjectcarriersusingPNjunctions

2.ΔfreecarriersΔindexofrefraction[Carrier-PlasmaEffect]

3.On-OffKeying(OOK)modulation

12

12|

MRMbasedOpticalLinks

?ModulationScheme:

1.Deplete/InjectcarriersusingPNjunctions

2.ΔfreecarriersΔindexofrefraction[Carrier-PlasmaEffect]

3.On-OffKeying(OOK)modulation

*.OMA:OpticalModulationAmplitude

13

13|

MRMbasedOpticalLinks

?ModulationScheme:

1.Deplete/InjectcarriersusingPNjunctions

2.ΔfreecarriersΔindexofrefraction[Carrier-PlasmaEffect]

3.On-OffKeying(OOK)modulation

*.OMA:OpticalModulationAmplitude

14|

Electronic-PhotonicIntegration

?Electronic-PhotonicIntegration

–Monolithic(LowParasitic,OldCMOS)vs.

2.5D/3D(LargeParasitic,AdvancedCMOS)

45nmSOICMOS+Photonics

?Laserintegration:off-packageor

integratedwithsiliconphotonics

[AMD,ISSCC2023]

15|

40Gb/sPAM-4TxinGF45nm

[Moazenietal.,ISSCC/JSSC17]

16|

State-of-the-ArtofCPO

Ranovus+AMD

?TotalBWperfiber:1Tb

?+10pJ/bfor100Gb/sperwavelength

?IntegratedLasers

?0.5Tb/s/mm

AyarLabs+Intel

?TotalBWperfiber:800Gb

?+5pj/bfor20Gb/sperwavelength

?ExternalLaser

?0.5Tb/s/mm

17|

MajorChallenges

?Electronic-PhotonicIntegration

–Monolithicvs.2.5D/3D

–Interconnectparasitics>3xDevicecap

?Energy-efficiency

–Need5-10ximprovement

–Electronic-photonicCo-design

?FiberPackaging

?DWDMLaserSources

[DallyOFC2022]

100Gb/sPAM-4Rx(3.9pJ/b)

[Lietal.,ISSCC21]

40Gb/sPAM-4Tx(0.5pJ/b)[Moazenietal.,ISSCC17]

18|

SolutionsforNext-generationCPO

?Advancedpackagingwithdirectbondinginsteadofmicro-bumps

?Electronic-photonicCo-design(e.g.,OpticalDAC)

Silicon-InterconnectFabric(SiF),UCLA2021

19|

BeyondJustanE/OBridge

[MoazeniOFC2018]

NewarchitectureswillbeunlockedwithCPO…

?Network-level:

–Micro-secondopticalcircuitswitchingnetworks

?Package-level:

–Co-processingontheCPO

–HBMmemoryaccess&controller

20|

CPOswithMemoryController

?BypassingGPUformemoryaccess

–AddCXLmemorycontroller

?ImprovingGPUDirectlatency

UltimateCPO-enabledArchitecture:

[AngLi,PNNL2019]

21|

Conclusion

?Co-packagedOpticscanprovidetheneedsofnextgenerationofGPU/Acceleratorinterconnects

?Next-generationCPOdemands

溫馨提示

  • 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請下載最新的WinRAR軟件解壓。
  • 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
  • 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁內(nèi)容里面會有圖紙預(yù)覽,若沒有圖紙預(yù)覽就沒有圖紙。
  • 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
  • 5. 人人文庫網(wǎng)僅提供信息存儲空間,僅對用戶上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理,對用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對任何下載內(nèi)容負(fù)責(zé)。
  • 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請與我們聯(lián)系,我們立即糾正。
  • 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時也不承擔(dān)用戶因使用這些下載資源對自己和他人造成任何形式的傷害或損失。

評論

0/150

提交評論