




已閱讀5頁,還剩7頁未讀, 繼續(xù)免費閱讀
版權(quán)說明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請進行舉報或認領(lǐng)
文檔簡介
arXiv:1707.01926v1cs.LG6Jul20171GraphConvolutionalRecurrentNeuralNetwork:Data-DrivenTracForecastingYaguangLiRoseYuCyrusShahabiYanLiuDepartmentofComputerScience,UniversityofSouthernCaliforniayaguang,qiyu,shahabi,AbstractSpatiotemporalforecastinghassignicantimplicationsinsustainability,transportationandhealth-caredomain.Tracforecastingisonecanonicalexampleofsuchlearningtask.Thistaskischallengingdueto(1)non-lineartemporaldynamicswithchangingroadconditions,(2)complexspatialdependenciesonroadnetworkstopologyand(3)inherentdicultyoflong-termtimeseriesforecasting.Toaddressthesechallenges,weproposeGraphConvolutionalRecurrentNeuralNetworktoincorporatebothspatialandtemporaldependencyintracow.Wefurtherintegratetheencoder-decoderframeworkandscheduledsamplingtoimprovelong-termforecasting.Whenevaluatedonreal-worldroadnetworktracdata,ourapproachcanaccuratelycapturespatiotemporalcorrelationsandconsistentlyoutperformsstate-of-the-artbaselinesby12%-15%.IntroductionSpatiotemporalforecastingisacrucialtaskforalearningsystemthatoperatesinadynamicenvironment.Accuratespatiotemporalforecastinghasawiderangeofapplicationsrangingfromvideocompressionandunderstanding,astoenergyandsmartgridmanagement,economicsandnance,toenvironmentalandhealthcare.Inthispaper,westudyoneexampleofspatiotemporalforecastingtask:tracforecasting,thecorecomponentoftheintelligenttransportationsystems.Webelieveourapproachisnotlimitedtotransportation,andisreadilyapplicabletootherdomainsaswell.Thegoaloftracforecastingistopredictthefuturespeedsofasensornetworkusingprevioustracspeedsaswellastheunderlyingroadnetworksstructure.Thistaskischallengingmainlyduetothecomplexspatialandtemporaldependencies.Ononehand,tractimeseriesdemonstratestrongtemporaldynamic.Recurringincidentssuchasrushhoursoraccidentscancausenon-stationarybehaviorintracspeeds,leadingtodicultyinlong-termforecasting.Ontheotherhand,multivariatetimeseriesfromasensornetworkcontaincomplexspatialcorrelations.Itisoftenthecasethatsuchspatialcorrelationarehighlylocalized.Figure1showstheweightslearnedfromtheauto-regressivemodelusingweightedaverageforasinglesensorprediction.Thelearnedweightshighlyconcentrateonitscloseneighbors.Anotherimportantcharacteristicsoftracisthe“conservationofow”,whichmeansthenumberofvehiclesinaroadnetworkstaysrelativelythesameduringashorttimeperiod.Intheliterature,tracforecastinghasbeenstudiedfordecades,fallingintotwomaincate-gories:data-drivenapproachandknowledge-drivenapproach.Intransportationandoperationalresearch,knowledge-drivenmethodsusuallyapplyqueuingtheoryandsimulateuserbehaviorsintrac5.Intimeseriescommunity,data-drivenmethodssuchasautoregressiveintegratedmovingaverage(ARIMA)modelandKalmanlteringremainpopular15,14.However,simpletimese-riesmodelsusuallyrelyonthestationarityassumptionofthetimeseries,andhavelimitedcapacity1torepresenthighlynonlineardynamics.Mostrecently,deeplearningmodelsfortracforecastingforecastinghavebeendevelopedin16,28.In10,theauthorsdevelopdeepauto-regressivemodelsformoregeneralspatiotemporalforecastingtask,e.g.,inventoryforecasting.However,thesedeeplearningmodelsonlyapplytounivariatetimeseriesorfocusonshort-termforecasting.Deepneuralnetworkmodelsforthedomainofspatiotemporalforecastingstaylargelyelusive.Ourworkservesasanimportantsteptointegratemanyimportantdevelopmentsindeeprecurrentneuralnetworksintotimeseriesanalysis,particularlyforspa-tiotemporalforecasting.Weleveragerecentadvancesingraphconvolution7,21andsequencemodeling6,3todesigntheGraphConvolutionalRecurrentNeuralNetwork(GCRNN).GCRNNmodelsboththespatialandthetemporaldependenceinthetracnetwork.Specically,weresorttorecurrentneuralnetworktocapturethenon-lineardynamics,andmodifytheGatedRecurrentUnittoincorporatetheunderlyingsensornet-workstructure.Thisisdonethroughtransformationofinputsequencethroughagraphconvolutionalkernel.Figure1:LocalspatialdependencyforToaddresstheerrorpropagationissueinlong-termfore-singlesensorlearnedfromweightedav-castingtask,wefurtherintegratetheencoder-decodererage.Largerweightsindicatehigherframeworkandscheduledsamplingtechnique3.Whencorrelation.evaluatedonthereal-worldtracdata,GCRNNconsistentlyoutperformsstate-of-the-arttracforecastingbaselinesbyalargemargin.Ourcontributionscanbesummarizedasfollows:Weinvestigatedtracforecasting,animportantmultivariatespatiotemporalforecastingtask,andidentieditsuniquespatiotemporaldependencystructure.Weproposedgraphconvolutionrecurrentneuralnetworkasawholisticframeworktoecientlycapturebothspatialandtemporalstructure.Theproposedapproachachievesthebestreportedresultsonreal-worldtracforecastingandobtainedsignicantimprovementoverstate-of-the-artmethods.2RelatedWorkTracforecastingisaclassicproblemintransportationandoperationalresearchwhicharelargelybasedonqueuingtheoryandsimulations9.Data-drivenapproachesfortracforecastinghavereceivedconsiderableattention,detailscanbefoundinarecentsurveypaper25andthereferencestherein.However,existingmachinelearningmodelseitherimposestrongstationaryassumptionsofthedata(e.g.,auto-regressivemodel)orfailtoaccountforhighlynon-lineartemporaldependency(e.g.,latentspacemodel27,8).Recently,deeplearningmodelsdelivernewpromisefortimeseriesforecastingproblem.Forexample,in28,theauthorsstudyunivaritetimeseriesforecastingusingdeepLSTMnetwork.In10,theauthorsproposeaprobabilisticdeepauto-regressiverecurrentframeworktoforecastinventorytimeseriesacrossdierentdomains.Theforecastingproblemwearefacinghereisspatiallycorrelatedtimeseries,whichrequirescarefulmodelingofbothspatialandtemporaldependency.Intermsofgeneralsequencemodeling,RecurrentNeuralNetworks(RNNs)havebecomethestate-of-the-artchoice,leadingtosuccessfulapplicationsinlanguagemodeling2,videogeneration23,speechrecognition17andweathernowcasting26.However,mostexisting2GraphConvolutionalGraphConvolutionalRecurrentLayerRecurrentLayerGraphConvolutionalGraphConvolutionalRecurrentLayerRecurrentLayerInputGraphSignalsPredictions.TimeDelay=1EncoderCopyStatesDecoderFigure2:SystemarchitectureforGraphConvolutionalRecurrentNeuralNetworkdesignedforspatiotemporaltracforecasting.deepsequencemodelsdealswitheitherdiscretetimesequenceorsequencesthatareevenlydistributedoveraregulargrid.Forinstance,convolutionalLSTMnetwork26capturesthespatiotemporalstructureamongpixelsbyapplyingaconvolutionallterovereachframeofthevideostream.Thelanguagesequencesareoftenencodedasdiscretetimeseries.Onthecontrary,timeseriesfromsensornetworksintracforecastingarecontinuoustimesequencesdistributedoveragraph.Closelyrelatedtoourworkisthedeeplearningmodelsfornon-Euclideanstructureddata.Forexample,in20,theauthorsproposeGraphNeuralNetworks(GNN)modelinthevertexspace,whichlearnsnoderepresentationsforthegraph.Lietal.13extendsGNNforsequencemodeling.TheresultingGatedGraphSequenceNeuralnetworkachievesthestate-of-the-artperformanceforprogramverication.Goingfromvertexdomaintospectraldomain,spectralgraphconvolutionalneuralnetworks(GCN)arerstintroducedin4,whichbridgesthespectralgraphtheoryanddeepneuralnetworks.In7,theauthorsfurtherimproveGCNwithfastlocalizedconvolutionslters.OurmodelextendsGCNtomodelmultivariatetimeseriesdistributedonanetwork.OurmodelcoincideswitharecentworkonsequentialgeneralizationofGCN21,however,wefocusoncontinuoustimepredictionandlong-termforecastingbyincorporatingencoder-decoderarchitecture24andscheduledsampling3techniques.3MethodologyWerstformalizethelearningproblemoftracforecastingandidentifyuniquespatiotemporaldependencystructures.Wethenproposeavariationofthedeeprecurrentneuralnetworkmodel.Givenaseriesofroadnetworksnapshots,ourmodeladdressesthreetechnicaldiculties:(1)localizedspatialdependency,(2)temporaldynamicsingraphs,and(3)long-termforecasting.3.1TracForecastingProblemThegoaloftracforecastingistopredictthefuturetracspeedbasedonpreviouslyobservedtracow.Thetracowismeasuredbynspatiotemporalcorrelatedsensorsontheroadnetwork.Thepair-wiserelationshipbetweenthosesensorscanbemodeledasaweightedgraphG=(V,E,A),whereVisanitesetof|V|=nvertices,whileEisasetofedgesandARnnis3Htimesteps,i.e.,Xt+1,Xt+H,whereHistheforecastinghorizon.aweightedadjacencymatrixrepresentingtheconnectivitybetweensensors.Thus,anobservationoftracspeedsatatimetcanbeviewedasagraphsignal,Xt:VRdx,wheredxisthedimensionofsignalineachnode.Thetracforecastingproblemcanbeformulatedasfollows:givensensorgraphGandhistoricaltracmeasurementsofsensors,inferthemostlikelytracmeasurementsinthenextXt+H,Xt+1=argmaxlogP(Xt+H,Xt+1|Xt,XtK+1;G,)Xt+H,Xt+1Supposethateachtimestepis5minutesandHis12,theoutputsofthemodelwillbethetracmeasurementofevery5minutesforallthesensorsinthenexthour.Notethattheaforementionedtracforecastingproblemisdierentfromthesingle-steptimeseriesforecastingproblem.ThepredictiontargetofourproblemisasequenceofmultivariatetimeseriesdistributedoveragraphGwhichcontainsbothspatialandtemporalstructures.Moreover,thepredictionproblemdenedin26canbeconsideredasaspecialcaseofthisproblemwhereGisaregulargrid.3.2SpatialDependencyModelingTractimeseriesfromroadnetworksensorsdemonstratestrongspatialdependency.Itismainlydueto(1)networkconnectivity:highwaynetworksusuallyhavesensorsinstalledevery1-2miles,andtracowofadjacentsensorsarehighlycorrelated;(2)owconservation:thenumberofvehiclesenteringandexitingtheroadsareapproximatelythesame.Unfortunately,recurrentneuralnetworks(RNNs)donotexplicitlymodelsuchspatialdependency.Inthiswork,weaugmentRNNsbyconsideringspatialcorrelationsamongmultivariatetimeseries.GraphAttentionMechanismInRNNs,theactivationisaweightedcombinationofallthehistoricalobservationsandhiddenstates,whilethespatialdependencyoftracisratherlocalized.Inordertoaccountforsuchlocaldependency,wegeneralizetheattentionmechanismfromsequencemodelingtospatialmodeling.Inparticular,weallowthemodeltolearntofocusoncloseneighborhoodsinsteadoftheentirenetwork.Thisisachievedbyrepresentingthehiddenstateofasensorusingacombinationofthehiddenstatesfromnearbysensorsweightedbyattention.Theattentionmechanismisdenedas:fatt(hi,hj)=hiWahj,aij=exp(fatt(hi,hj)knb(i,K)exp(fatt(hi,hk),gi=jnb(i,K)aijhj,(1)wherehidenotesthehiddenstatesofsensoriwhichisextractedusingaRNNsharedacrossallthenodes.nb(i,K)returnsthesetofneighborsthatarewithinK-hopfromnodei,andgirepresentstheaggregatedhiddenstatefornodeithatincorporatesinformationfromneighborhoodnodes.Then,theforecastingtaskofnodeiisimplementedusingafullyconnectedfeedforwardnetworkwithgiastheinput.GraphLaplacianTransformationGraphattentionmechanismenablesexplicitnetworkstructuremodeling,butinpracticeitonlyleadstomarginalperformanceimprovement.Thisispartlybecauseofthedicultyintrainingsuchmodelasitistimeconsumingtocomputepair-wiseattentionforlargenumberofnodes.Anotherreasonisthatgraphattentiononlymodelsthetopologicaldependencyinthevertexdomain,andyetitfailstocapturethe“conservationofow”propertyintrac.WeresolvethisissuebytransformingthetractimeseriesfromvertexdomainintothespectraldomainusinggraphLaplacian.4Figure3:Visualizationofeigen-functions.(a)showssensorlocationsonthemap.(b)and(c)correspondtoeigen-functionswithsmalleigenvalue(lowfrequencyandsmooth)while(d)and(e)correspondtooneswithlargeeigenvalues(highfrequencyandnon-smooth).GraphLaplacianisadiscreteversionofLaplacianoperator,whichcharacterizesthecon-nectivityofthegraph.ApplyingLaplacianoperatory=Lxtothesignalrepresentsone-stepdiusionofthesignalonthegraph.WearguethatitisnaturaltouseGraphLaplacianoperatorfortracforecastingproblems.Ifwemodelthechangeoftracowasxti(t)=jAij(xixj),wehavexti(t)=cLix,whereAijistheelementoftheadjacencymatrixofthegraph,ListhegraphLaplacianandcisaconstant.Thissharessimilarformastheheatequation,whichisgivenbythelaw“conservationofenergyinphysics.Inimageprocessing,thistransformationisknownasgraphconvolutionalkernel,denotedasg.Tracforecastingproblemprovidesanalternativemotivationofperformingsuchtransformation.ToobtaintheLaplacianmatrix,weconstructtheadjacencymatrixbasedonroadnetworkdistancewithathresholdedGaussiankernel22.Figure3visualizestheeigen-functionsofthenormalizedLaplacianmatrixforpartoftheroadnetworkinLosAngeles.Smalleigen-functionsrepresentsmoothspatialdependencywhilelargeonesdenotehighoscillation.WecanmakesomeinterestingobservationsinFigure3,whichcouldhelpexplainthespatialdependencycapturedbyLaplacian.Forexample,in(c)NearUniversalStudiosHollywood,atthecrossingofhighway101,134and170(d)NearRoseBowlStadium,atthecrossingofhighway2and134.Todealwithspatialdependencyatdierentresolutions,wecomputeaweightedsumofkthpowerofLaplacianasthespectraltransformation.ThisisbasedonthefactthatkthpowerofLaplacianissupportedbyexactlyk-hopneighbors22,representingthespreadoftracowatdierentscale.ComputingthekthpowerLaplacianmatrixcanbecomputationallyexpensive,soweapplyChebyshevpolynomialexpansion7forecientapproximation.K1K1K1y=gw(L)x=wkLx=UwkUxwkTk()xtttktkt(2)k=0k=0k=0ofChebyshevcoecientswhileTk()RnnistheChebyshevpolynomialoforderkevaluatedat=2/maxI.ThisapproximationreducesthelteringcomputationalcostfromO(|V|2)wherethegw(L)isthelearnedlterbasedonLaplacianmatrix.parameterwRKisavectortoO(K|E|).3.3TemporalDynamicsModelingWemodelthetemporaldynamicsintheframeworkofrecurrentneuralnetworks.OneofthevariantsofRNNistheGatedRecurrentUnits(GRU)6whichhasasimplerstructureandcompetitiveperformancecomparingwithLSTM.GatedGraphConvolutionWeincorporatespatialdependencyintoGRUbyreplacingthematrixmultiplicationwiththegraphconvolutionGdenedinEquation2.Thisgraphconvolu-5Figure4:Visualizationof24hoursroadnetworktractimeseriesevolutioninspectraldomainwithLaplaciantransformedinput(toprow)andvertexdomainwithrawinput(bottomrow).Spectraldomainenjoysbettersparsity.Theskewnessofthedistributionofthetransformedinputreectsthetraccongestioncondition.tionaloperationisappliedtobothinputsandhiddenstatestoobtainaGraphConvolutionalGatedRecurrentUnit(GCGRU).rt=(WrGxt+UrGht1+br)ut=(WuGxt+UuGht1+bu)ct=tanh(WcGxt+UcG(rtht1)+bc)ht=utht1+(1ut)ctWestackGRUandunrolltherecurrenceforaxednumberofstepsTanduseback-propagationthroughtimeinordertocomputegradients.Figure4showstheroadnetworktracevolutionin24hours,goingthroughmorningrushhourandafternoonrushhour.Wecanseethatinspectraldomain,thetracspeedtimeseriesenjoysbettersparsitythaninthevertexdomain.Thedistributionofthetransformedinputreectthetraccongestioncondition.Withheavycongestioninrushhour,thespectraldistributionofthetimeseriesbecomemoreheavy-tailed.Long-TermForecastingInlong-termforecasting,simplytrainingthemodelforonestepaheadprediction,andthenback-feedingthepredictionsattesttimeispronetoerrorpropagation.Theforecastingerrorinearlierstepscouldbequicklyampliedoverlong-timespan.Wedrawinspirationfromtheencoder-decoderarchitecture24aswellasscheduledsam
溫馨提示
- 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請下載最新的WinRAR軟件解壓。
- 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
- 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁內(nèi)容里面會有圖紙預(yù)覽,若沒有圖紙預(yù)覽就沒有圖紙。
- 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
- 5. 人人文庫網(wǎng)僅提供信息存儲空間,僅對用戶上傳內(nèi)容的表現(xiàn)方式做保護處理,對用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對任何下載內(nèi)容負責。
- 6. 下載文件中如有侵權(quán)或不適當內(nèi)容,請與我們聯(lián)系,我們立即糾正。
- 7. 本站不保證下載資源的準確性、安全性和完整性, 同時也不承擔用戶因使用這些下載資源對自己和他人造成任何形式的傷害或損失。
最新文檔
- 2025個性化家具定制銷售合同
- 2025區(qū)域銷售代理合同范本
- 2025年的經(jīng)濟適用房買賣合同范本
- 《比較發(fā)展模式》課件
- 2025雇傭人員勞動合同范本
- 《健康生活與疾病預(yù)防》課件
- 超靜定結(jié)構(gòu)概述超靜定次數(shù)的確定去掉多余約束法
- 《近代藝術(shù)》課件
- 《青少年文學(xué)鑒賞指導(dǎo)》課件
- 激光去紋身的臨床護理
- 一年級語文下冊課件-21 小壁虎借尾巴24-部編版(15張PPT)
- 患者隨訪率低原因分析以及對策
- DB32∕T 2349-2013 楊樹一元立木材積表
- 首屆上??萍计诳庉嫾寄艽筚愒囶}
- 隧道二襯、仰拱施工方案
- Q∕GDW 12106.4-2021 物聯(lián)管理平臺技術(shù)和功能規(guī)范 第4部分:邊緣物聯(lián)代理與物聯(lián)管理平臺交互協(xié)議規(guī)范
- 中國癲癇診療指南-癲癇持續(xù)狀態(tài)課件
- 計量認證實驗室程序文件(全套)
- DGC型瓦斯含量直接測定裝置使用說明書
- 普通座式焊接變位機工裝設(shè)計全套圖紙
- 客戶報備申請表(共3頁)
評論
0/150
提交評論