




已閱讀5頁,還剩7頁未讀, 繼續(xù)免費閱讀
版權(quán)說明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請進行舉報或認領(lǐng)
文檔簡介
arXiv:1707.01926v1cs.LG6Jul20171GraphConvolutionalRecurrentNeuralNetwork:Data-DrivenTracForecastingYaguangLiRoseYuCyrusShahabiYanLiuDepartmentofComputerScience,UniversityofSouthernCaliforniayaguang,qiyu,shahabi,AbstractSpatiotemporalforecastinghassignicantimplicationsinsustainability,transportationandhealth-caredomain.Tracforecastingisonecanonicalexampleofsuchlearningtask.Thistaskischallengingdueto(1)non-lineartemporaldynamicswithchangingroadconditions,(2)complexspatialdependenciesonroadnetworkstopologyand(3)inherentdicultyoflong-termtimeseriesforecasting.Toaddressthesechallenges,weproposeGraphConvolutionalRecurrentNeuralNetworktoincorporatebothspatialandtemporaldependencyintracow.Wefurtherintegratetheencoder-decoderframeworkandscheduledsamplingtoimprovelong-termforecasting.Whenevaluatedonreal-worldroadnetworktracdata,ourapproachcanaccuratelycapturespatiotemporalcorrelationsandconsistentlyoutperformsstate-of-the-artbaselinesby12%-15%.IntroductionSpatiotemporalforecastingisacrucialtaskforalearningsystemthatoperatesinadynamicenvironment.Accuratespatiotemporalforecastinghasawiderangeofapplicationsrangingfromvideocompressionandunderstanding,astoenergyandsmartgridmanagement,economicsandnance,toenvironmentalandhealthcare.Inthispaper,westudyoneexampleofspatiotemporalforecastingtask:tracforecasting,thecorecomponentoftheintelligenttransportationsystems.Webelieveourapproachisnotlimitedtotransportation,andisreadilyapplicabletootherdomainsaswell.Thegoaloftracforecastingistopredictthefuturespeedsofasensornetworkusingprevioustracspeedsaswellastheunderlyingroadnetworksstructure.Thistaskischallengingmainlyduetothecomplexspatialandtemporaldependencies.Ononehand,tractimeseriesdemonstratestrongtemporaldynamic.Recurringincidentssuchasrushhoursoraccidentscancausenon-stationarybehaviorintracspeeds,leadingtodicultyinlong-termforecasting.Ontheotherhand,multivariatetimeseriesfromasensornetworkcontaincomplexspatialcorrelations.Itisoftenthecasethatsuchspatialcorrelationarehighlylocalized.Figure1showstheweightslearnedfromtheauto-regressivemodelusingweightedaverageforasinglesensorprediction.Thelearnedweightshighlyconcentrateonitscloseneighbors.Anotherimportantcharacteristicsoftracisthe“conservationofow”,whichmeansthenumberofvehiclesinaroadnetworkstaysrelativelythesameduringashorttimeperiod.Intheliterature,tracforecastinghasbeenstudiedfordecades,fallingintotwomaincate-gories:data-drivenapproachandknowledge-drivenapproach.Intransportationandoperationalresearch,knowledge-drivenmethodsusuallyapplyqueuingtheoryandsimulateuserbehaviorsintrac5.Intimeseriescommunity,data-drivenmethodssuchasautoregressiveintegratedmovingaverage(ARIMA)modelandKalmanlteringremainpopular15,14.However,simpletimese-riesmodelsusuallyrelyonthestationarityassumptionofthetimeseries,andhavelimitedcapacity1torepresenthighlynonlineardynamics.Mostrecently,deeplearningmodelsfortracforecastingforecastinghavebeendevelopedin16,28.In10,theauthorsdevelopdeepauto-regressivemodelsformoregeneralspatiotemporalforecastingtask,e.g.,inventoryforecasting.However,thesedeeplearningmodelsonlyapplytounivariatetimeseriesorfocusonshort-termforecasting.Deepneuralnetworkmodelsforthedomainofspatiotemporalforecastingstaylargelyelusive.Ourworkservesasanimportantsteptointegratemanyimportantdevelopmentsindeeprecurrentneuralnetworksintotimeseriesanalysis,particularlyforspa-tiotemporalforecasting.Weleveragerecentadvancesingraphconvolution7,21andsequencemodeling6,3todesigntheGraphConvolutionalRecurrentNeuralNetwork(GCRNN).GCRNNmodelsboththespatialandthetemporaldependenceinthetracnetwork.Specically,weresorttorecurrentneuralnetworktocapturethenon-lineardynamics,andmodifytheGatedRecurrentUnittoincorporatetheunderlyingsensornet-workstructure.Thisisdonethroughtransformationofinputsequencethroughagraphconvolutionalkernel.Figure1:LocalspatialdependencyforToaddresstheerrorpropagationissueinlong-termfore-singlesensorlearnedfromweightedav-castingtask,wefurtherintegratetheencoder-decodererage.Largerweightsindicatehigherframeworkandscheduledsamplingtechnique3.Whencorrelation.evaluatedonthereal-worldtracdata,GCRNNconsistentlyoutperformsstate-of-the-arttracforecastingbaselinesbyalargemargin.Ourcontributionscanbesummarizedasfollows:Weinvestigatedtracforecasting,animportantmultivariatespatiotemporalforecastingtask,andidentieditsuniquespatiotemporaldependencystructure.Weproposedgraphconvolutionrecurrentneuralnetworkasawholisticframeworktoecientlycapturebothspatialandtemporalstructure.Theproposedapproachachievesthebestreportedresultsonreal-worldtracforecastingandobtainedsignicantimprovementoverstate-of-the-artmethods.2RelatedWorkTracforecastingisaclassicproblemintransportationandoperationalresearchwhicharelargelybasedonqueuingtheoryandsimulations9.Data-drivenapproachesfortracforecastinghavereceivedconsiderableattention,detailscanbefoundinarecentsurveypaper25andthereferencestherein.However,existingmachinelearningmodelseitherimposestrongstationaryassumptionsofthedata(e.g.,auto-regressivemodel)orfailtoaccountforhighlynon-lineartemporaldependency(e.g.,latentspacemodel27,8).Recently,deeplearningmodelsdelivernewpromisefortimeseriesforecastingproblem.Forexample,in28,theauthorsstudyunivaritetimeseriesforecastingusingdeepLSTMnetwork.In10,theauthorsproposeaprobabilisticdeepauto-regressiverecurrentframeworktoforecastinventorytimeseriesacrossdierentdomains.Theforecastingproblemwearefacinghereisspatiallycorrelatedtimeseries,whichrequirescarefulmodelingofbothspatialandtemporaldependency.Intermsofgeneralsequencemodeling,RecurrentNeuralNetworks(RNNs)havebecomethestate-of-the-artchoice,leadingtosuccessfulapplicationsinlanguagemodeling2,videogeneration23,speechrecognition17andweathernowcasting26.However,mostexisting2GraphConvolutionalGraphConvolutionalRecurrentLayerRecurrentLayerGraphConvolutionalGraphConvolutionalRecurrentLayerRecurrentLayerInputGraphSignalsPredictions.TimeDelay=1EncoderCopyStatesDecoderFigure2:SystemarchitectureforGraphConvolutionalRecurrentNeuralNetworkdesignedforspatiotemporaltracforecasting.deepsequencemodelsdealswitheitherdiscretetimesequenceorsequencesthatareevenlydistributedoveraregulargrid.Forinstance,convolutionalLSTMnetwork26capturesthespatiotemporalstructureamongpixelsbyapplyingaconvolutionallterovereachframeofthevideostream.Thelanguagesequencesareoftenencodedasdiscretetimeseries.Onthecontrary,timeseriesfromsensornetworksintracforecastingarecontinuoustimesequencesdistributedoveragraph.Closelyrelatedtoourworkisthedeeplearningmodelsfornon-Euclideanstructureddata.Forexample,in20,theauthorsproposeGraphNeuralNetworks(GNN)modelinthevertexspace,whichlearnsnoderepresentationsforthegraph.Lietal.13extendsGNNforsequencemodeling.TheresultingGatedGraphSequenceNeuralnetworkachievesthestate-of-the-artperformanceforprogramverication.Goingfromvertexdomaintospectraldomain,spectralgraphconvolutionalneuralnetworks(GCN)arerstintroducedin4,whichbridgesthespectralgraphtheoryanddeepneuralnetworks.In7,theauthorsfurtherimproveGCNwithfastlocalizedconvolutionslters.OurmodelextendsGCNtomodelmultivariatetimeseriesdistributedonanetwork.OurmodelcoincideswitharecentworkonsequentialgeneralizationofGCN21,however,wefocusoncontinuoustimepredictionandlong-termforecastingbyincorporatingencoder-decoderarchitecture24andscheduledsampling3techniques.3MethodologyWerstformalizethelearningproblemoftracforecastingandidentifyuniquespatiotemporaldependencystructures.Wethenproposeavariationofthedeeprecurrentneuralnetworkmodel.Givenaseriesofroadnetworksnapshots,ourmodeladdressesthreetechnicaldiculties:(1)localizedspatialdependency,(2)temporaldynamicsingraphs,and(3)long-termforecasting.3.1TracForecastingProblemThegoaloftracforecastingistopredictthefuturetracspeedbasedonpreviouslyobservedtracow.Thetracowismeasuredbynspatiotemporalcorrelatedsensorsontheroadnetwork.Thepair-wiserelationshipbetweenthosesensorscanbemodeledasaweightedgraphG=(V,E,A),whereVisanitesetof|V|=nvertices,whileEisasetofedgesandARnnis3Htimesteps,i.e.,Xt+1,Xt+H,whereHistheforecastinghorizon.aweightedadjacencymatrixrepresentingtheconnectivitybetweensensors.Thus,anobservationoftracspeedsatatimetcanbeviewedasagraphsignal,Xt:VRdx,wheredxisthedimensionofsignalineachnode.Thetracforecastingproblemcanbeformulatedasfollows:givensensorgraphGandhistoricaltracmeasurementsofsensors,inferthemostlikelytracmeasurementsinthenextXt+H,Xt+1=argmaxlogP(Xt+H,Xt+1|Xt,XtK+1;G,)Xt+H,Xt+1Supposethateachtimestepis5minutesandHis12,theoutputsofthemodelwillbethetracmeasurementofevery5minutesforallthesensorsinthenexthour.Notethattheaforementionedtracforecastingproblemisdierentfromthesingle-steptimeseriesforecastingproblem.ThepredictiontargetofourproblemisasequenceofmultivariatetimeseriesdistributedoveragraphGwhichcontainsbothspatialandtemporalstructures.Moreover,thepredictionproblemdenedin26canbeconsideredasaspecialcaseofthisproblemwhereGisaregulargrid.3.2SpatialDependencyModelingTractimeseriesfromroadnetworksensorsdemonstratestrongspatialdependency.Itismainlydueto(1)networkconnectivity:highwaynetworksusuallyhavesensorsinstalledevery1-2miles,andtracowofadjacentsensorsarehighlycorrelated;(2)owconservation:thenumberofvehiclesenteringandexitingtheroadsareapproximatelythesame.Unfortunately,recurrentneuralnetworks(RNNs)donotexplicitlymodelsuchspatialdependency.Inthiswork,weaugmentRNNsbyconsideringspatialcorrelationsamongmultivariatetimeseries.GraphAttentionMechanismInRNNs,theactivationisaweightedcombinationofallthehistoricalobservationsandhiddenstates,whilethespatialdependencyoftracisratherlocalized.Inordertoaccountforsuchlocaldependency,wegeneralizetheattentionmechanismfromsequencemodelingtospatialmodeling.Inparticular,weallowthemodeltolearntofocusoncloseneighborhoodsinsteadoftheentirenetwork.Thisisachievedbyrepresentingthehiddenstateofasensorusingacombinationofthehiddenstatesfromnearbysensorsweightedbyattention.Theattentionmechanismisdenedas:fatt(hi,hj)=hiWahj,aij=exp(fatt(hi,hj)knb(i,K)exp(fatt(hi,hk),gi=jnb(i,K)aijhj,(1)wherehidenotesthehiddenstatesofsensoriwhichisextractedusingaRNNsharedacrossallthenodes.nb(i,K)returnsthesetofneighborsthatarewithinK-hopfromnodei,andgirepresentstheaggregatedhiddenstatefornodeithatincorporatesinformationfromneighborhoodnodes.Then,theforecastingtaskofnodeiisimplementedusingafullyconnectedfeedforwardnetworkwithgiastheinput.GraphLaplacianTransformationGraphattentionmechanismenablesexplicitnetworkstructuremodeling,butinpracticeitonlyleadstomarginalperformanceimprovement.Thisispartlybecauseofthedicultyintrainingsuchmodelasitistimeconsumingtocomputepair-wiseattentionforlargenumberofnodes.Anotherreasonisthatgraphattentiononlymodelsthetopologicaldependencyinthevertexdomain,andyetitfailstocapturethe“conservationofow”propertyintrac.WeresolvethisissuebytransformingthetractimeseriesfromvertexdomainintothespectraldomainusinggraphLaplacian.4Figure3:Visualizationofeigen-functions.(a)showssensorlocationsonthemap.(b)and(c)correspondtoeigen-functionswithsmalleigenvalue(lowfrequencyandsmooth)while(d)and(e)correspondtooneswithlargeeigenvalues(highfrequencyandnon-smooth).GraphLaplacianisadiscreteversionofLaplacianoperator,whichcharacterizesthecon-nectivityofthegraph.ApplyingLaplacianoperatory=Lxtothesignalrepresentsone-stepdiusionofthesignalonthegraph.WearguethatitisnaturaltouseGraphLaplacianoperatorfortracforecastingproblems.Ifwemodelthechangeoftracowasxti(t)=jAij(xixj),wehavexti(t)=cLix,whereAijistheelementoftheadjacencymatrixofthegraph,ListhegraphLaplacianandcisaconstant.Thissharessimilarformastheheatequation,whichisgivenbythelaw“conservationofenergyinphysics.Inimageprocessing,thistransformationisknownasgraphconvolutionalkernel,denotedasg.Tracforecastingproblemprovidesanalternativemotivationofperformingsuchtransformation.ToobtaintheLaplacianmatrix,weconstructtheadjacencymatrixbasedonroadnetworkdistancewithathresholdedGaussiankernel22.Figure3visualizestheeigen-functionsofthenormalizedLaplacianmatrixforpartoftheroadnetworkinLosAngeles.Smalleigen-functionsrepresentsmoothspatialdependencywhilelargeonesdenotehighoscillation.WecanmakesomeinterestingobservationsinFigure3,whichcouldhelpexplainthespatialdependencycapturedbyLaplacian.Forexample,in(c)NearUniversalStudiosHollywood,atthecrossingofhighway101,134and170(d)NearRoseBowlStadium,atthecrossingofhighway2and134.Todealwithspatialdependencyatdierentresolutions,wecomputeaweightedsumofkthpowerofLaplacianasthespectraltransformation.ThisisbasedonthefactthatkthpowerofLaplacianissupportedbyexactlyk-hopneighbors22,representingthespreadoftracowatdierentscale.ComputingthekthpowerLaplacianmatrixcanbecomputationallyexpensive,soweapplyChebyshevpolynomialexpansion7forecientapproximation.K1K1K1y=gw(L)x=wkLx=UwkUxwkTk()xtttktkt(2)k=0k=0k=0ofChebyshevcoecientswhileTk()RnnistheChebyshevpolynomialoforderkevaluatedat=2/maxI.ThisapproximationreducesthelteringcomputationalcostfromO(|V|2)wherethegw(L)isthelearnedlterbasedonLaplacianmatrix.parameterwRKisavectortoO(K|E|).3.3TemporalDynamicsModelingWemodelthetemporaldynamicsintheframeworkofrecurrentneuralnetworks.OneofthevariantsofRNNistheGatedRecurrentUnits(GRU)6whichhasasimplerstructureandcompetitiveperformancecomparingwithLSTM.GatedGraphConvolutionWeincorporatespatialdependencyintoGRUbyreplacingthematrixmultiplicationwiththegraphconvolutionGdenedinEquation2.Thisgraphconvolu-5Figure4:Visualizationof24hoursroadnetworktractimeseriesevolutioninspectraldomainwithLaplaciantransformedinput(toprow)andvertexdomainwithrawinput(bottomrow).Spectraldomainenjoysbettersparsity.Theskewnessofthedistributionofthetransformedinputreectsthetraccongestioncondition.tionaloperationisappliedtobothinputsandhiddenstatestoobtainaGraphConvolutionalGatedRecurrentUnit(GCGRU).rt=(WrGxt+UrGht1+br)ut=(WuGxt+UuGht1+bu)ct=tanh(WcGxt+UcG(rtht1)+bc)ht=utht1+(1ut)ctWestackGRUandunrolltherecurrenceforaxednumberofstepsTanduseback-propagationthroughtimeinordertocomputegradients.Figure4showstheroadnetworktracevolutionin24hours,goingthroughmorningrushhourandafternoonrushhour.Wecanseethatinspectraldomain,thetracspeedtimeseriesenjoysbettersparsitythaninthevertexdomain.Thedistributionofthetransformedinputreectthetraccongestioncondition.Withheavycongestioninrushhour,thespectraldistributionofthetimeseriesbecomemoreheavy-tailed.Long-TermForecastingInlong-termforecasting,simplytrainingthemodelforonestepaheadprediction,andthenback-feedingthepredictionsattesttimeispronetoerrorpropagation.Theforecastingerrorinearlierstepscouldbequicklyampliedoverlong-timespan.Wedrawinspirationfromtheencoder-decoderarchitecture24aswellasscheduledsam
溫馨提示
- 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請下載最新的WinRAR軟件解壓。
- 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
- 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁內(nèi)容里面會有圖紙預(yù)覽,若沒有圖紙預(yù)覽就沒有圖紙。
- 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
- 5. 人人文庫網(wǎng)僅提供信息存儲空間,僅對用戶上傳內(nèi)容的表現(xiàn)方式做保護處理,對用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對任何下載內(nèi)容負責(zé)。
- 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請與我們聯(lián)系,我們立即糾正。
- 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時也不承擔(dān)用戶因使用這些下載資源對自己和他人造成任何形式的傷害或損失。
最新文檔
- 2025至2030年中國離子刻蝕機市場分析及競爭策略研究報告
- 2025至2030年中國電子鎮(zhèn)流器多點溫度巡檢儀市場分析及競爭策略研究報告
- 2025至2030年中國油壓熱壓整形機市場分析及競爭策略研究報告
- 2025至2030年中國枸杞提取物市場分析及競爭策略研究報告
- 2025至2030年中國射流式深井自動泵市場分析及競爭策略研究報告
- 2025至2030年中國衛(wèi)生消毒用品市場分析及競爭策略研究報告
- 2025至2030年中國乳品加工罐市場分析及競爭策略研究報告
- 機電專業(yè)培訓(xùn)
- 幼兒園消毒液培訓(xùn)
- 角膜白斑護理查房
- 裝飾設(shè)計公司利潤分析報告
- 國際貿(mào)易咨詢服務(wù)合作協(xié)議
- 新生兒吸入性肺炎的觀察與護理
- 醫(yī)院布草洗滌服務(wù)方案(技術(shù)方案)
- 嬰幼兒輔食添加課件
- 單片機課程設(shè)計之超聲波測距-電子工程系單片機課程設(shè)計報告
- 寧騷公共政策學(xué)
- 地下室頂板行車與堆載驗算與加固方案
- GB/T 37234-2018文件鑒定通用規(guī)范
- 高中英語讀后續(xù)寫教學(xué)策略的探究
- 2022年動畫制作行業(yè)分析及未來五至十年行業(yè)發(fā)展報告
評論
0/150
提交評論