




版權(quán)說(shuō)明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請(qǐng)進(jìn)行舉報(bào)或認(rèn)領(lǐng)
文檔簡(jiǎn)介
IceCubeProjectDataStorageRequirements,2008to2013
Introduction
SincetheoriginaldesignestimatesfortheIceCubeproject,whicharecapturedinthe“PreliminaryDesignDocument”,therequirementsfordatastoragehaveevolveddramatically.Forinstance,thepresentvolumeofsimulationdata,tosimulateIC22andpriorconfigurations,heldondiskislargerthanthetotalestimatefor15yearsofexperimentalandsimulationdatainthePDD.
Thisincreaseindatavolumescanbetracedtonumeroussources,howeveritisdifficulttoquantifytheeffectofeachintermsoftheircontributiontotheresultingtotalincrease.MajorincreasescanbeattributedtotheDAQ,withtheaverageeventsizebeingsignificantlyhigherthanoriginallyestimated.Thisisthenimmediatelycompoundedbythechangeinemphasistowardsalowtriggerthreshold.
Itwasalsoexpectedthatthefilteringsystemwouldbemoreadvancedthanwhatitispresently.Whilethefilteringsystemiscertainlyreachingahighlevelofmaturity,itisstillatthepointwhereithasbeennecessarytore-filterlargevolumes(manytensofTB)ofrawdata,andlargevolumesofunfilteredsimulationdataisbeingheldonline.Itmayhavebeenthattheexpectationsofbeingabletoputthissoftwarequicklyintoproductionweretoohigh.
During2007itbecameobviousthatthestoragesystemwouldnotbeabletomeettheongoingneedsoftheexperiment.Manymeasureswereattemptedtominimizetheimpactofgrowingrequirements,suchasrunningatveryhighutilizationrates,andstretchinginfrastructuretoit’slimits.Thishaspredictablyresultedinincreasedfragilityandassociateddowntime.Ithasalsoresultedinthelossoftheabilitytorespondquicklytoexpansionneedswithnotonlythediskbeingfullyutilized,butinfrastructuresuchasserversandSANswitchesatcapacity.
InresponsetothissituationameetingwasheldinNovember2007,andthenamoredetailedfollowupinJanuary2008.FromtheJanuarymeetingrequirementsforeachpresentlyidentifiedareawerecaptured,andanewanalysisstorageareaproposed.ThisdocumentwillgiveanoverviewoftheIceCubestoragetypesandareas,andpresenttherequirementsascapturedattheJanuarymeeting.
Overview
ThetotaldatastoragerequirementsforIceCubearelargebutnotmassive.Originally,whentherequirementsweremoremodest,itwasexpectedthatitwouldbepossibletostorealldataononlinedisk.Howeverithasreachedthepointthatthisisnotpracticalwithinbudgetconsiderations.AlsolargeonlinediskvolumesresultincorrespondinglyhighDataCenterspacerequirements,electrical,andcoolingneeds.Inresponsetothisitwasdecidedtointroduceatapebasedfilesystemforlongtermstorageofpotentiallylargevolumesofdata.ThissystemwasoriginallydesignedtohavelimitedfunctionalityprimarilyaimedatrestoringSouthPoledatatapes.Itisproposedthissystembeexpandedtomeetotherstorageneeds.
PresentlytheIceCubeDataCenterhasapproximately250TB(usable)ofonlinedisk,mostlyonanonspecifichardwarevendorsoftwarebasedSAN(Ibrix),andtoalesserextentdirectattachedstorage.RecentlyanHSMtapebasedfilesystemwasaddedwithaninitiallicensedcapacityof80TB.Inadditiontouseraccessiblestoragethereisabackupsystemwhichdoesnightlyincrementalbackups,andperiodicoffsitebackups.ThebackupsystemusesAtempoTimeNavigatorsoftware,andaSpectraLogicT950tapelibrary.
Theonlinestorageispresentlydividedinto3areas,experimentaldata,simulationdata,anduserdata.Ithasbeenproposedthatanadditionalstorageareaforanalysisbeadded.Thepresentstatusofonlinestorageisasfollows.
/data/exp141TB,93%used
/data/sim69TB89%used
/net/user(total)33TBbrokendownto24TB(UW)13TBused,9TB(non-UW)7.1TBused
Theuserstorageareaisarelativelysmallstorageareawhicheverycollaboratorhasaccessto.Eachuseriscurrentlylimitedto100GB,unlesstheirinstitutionprovidesfundingtoincreasethislimit,suchasthecasewithUW.ItisapracticalnecessitytohaveastorageareacloselycoupledtotheDataWarehouseforeachactiveuser,whichisessentiallyanextensionoftheirhomedirectory,whilebeingphysicallyseparatetoavoidanyperformanceissuesassociatedwithdatastorage,whichcouldimpactroutineoperationsdependantonhomedirectories.
TheIceCubestoragetypes,andassociatedresponsiblepeople,areasfollows.Thisincludestheproposedareaofanalysisstorage.
PoleDAQ:KaelHanson
OnlineFiltering:ErikBlaufuss,chairofTFTBoard
OfflineFiltering:MartinMerck
Simulation:PaoloDesiati
Analysis:GaryHill,AnalysisCoordinator
TherequirementspresentedinthisdocumenthavemostlyoriginatedatthemeetingonJanuary28th2008inMadison,whichproducedthedocument“NotesoftheStorageRequirementsMeeting”.ThisdocumentisaccompaniedbyadetailedIceCubeDataStorageRequirementsspreadsheet.
Requirements
SouthPoleDAQandOnlineFiltering
KaelHansonandErikBlaufuss
Itisproposedthatforthecomingyear,IC40,thatthecurrentarchivingarrangementcontinue,with3keytypesofDAQdataoutputbemaintained.Theseare:
Rawdatastream
DataTransferredoverthesatellite,predominatelyfiltereddata
Filtereddatanottransferredoverthesatellite
ForIC40theexpectedsizeofthesedatastreamsare,raw500GBperday(610GBuncompressed),satellite30GBperday,andfilteredbutnotoversatellite15GBperday.
ItishopedthatduringtheIC40periodthatonlinefilteringwillreachalevelofmaturitythatarchivingoftherawdatacouldbediscontinued.Infutureyearsitisproposedthattheentirerawdatastreamnotbearchived,andthattherawdataistapedforalimitedperiodaftertheadditionofnewstringsduringasettlinginperiodofthenewfilter,lessthan2months.
TheexpecteddatavolumesforIC60are750GBperday,andfiltereddata80GBperday.ForIC80theexpecteddatavolumesare1TBperdayrawdata,and135GBperdayoffiltereddata.Thedatavolumeoffiltereddatatobetransferredoverthesatellite,andfortapingforlaterphysicaltransport,willdependonupgradestotheTDRStransfersystemandconflictswithotherSouthPoleusers.In2008/2009itisexpectedthatoperationswillmovetousingTDRSF3,allowingIceCubetotransfer60GBperdayresultingin20GBperdayoffiltereddatabeingtaped.
Alldatavolumesarecompresseddata.
Theaccompanyingspreadsheetshowsbothscenariosoftaping,andnottaping,rawdata.Italsoassumesatapetechnologyupgradein2010.MoredetailsaboutSouthPoletapingiscontainedinadocument“ProposalforArchivingofDataatSouthPole”,February2008.
SimulationData
PaoloDesiati
Thedatastoragerequirementsofsimulationdependonnumerousfactorswhicharesummarizedintheattachedspreadsheet.Itwasdeterminedthattheimmediateneedsofsimulationwasanadditional10TBtocompleteIC40simulation,andmidyearanadditional50TB,andanadditional30TBinthefalltodoIC60simulation.
NextyearforIC80simulationitisexpectedthat60TBofdatacouldbemovedtoanotherstoragemediasuchastheHSM,andthatanadditional60TBwouldberequired.Thiswouldbethesteadystateforan80stringdetector.
ExperimentalData
MartinMerck
ExperimentaldataisdefinedasanydatatransferredfromtheSouthPole,andtheoutputof“PrimaryDataProcessing”(a.k.a.OfflineFiltering).Thedistinctionbetween“PrimaryDataProcessing”andAnalysisDataisbasedontheprojectswelldefinedprocessinplaceforthetwoareasmorethananythingelse.
Thedatalifecycleoffiltereddataintheofflinefilteringsystemisthatdataundergoes3processesfromtheoriginaloutputoftheonlinefiltering.TheoutputoftheonlinefilteringsystemissaidtobePFFilt.TheofflinefilteringisdoneinthenorthernhemisphereandtheoutputofeachfilteringprocessisLevel0,Level1,andLevel2.ThesizeoftheLevel2dataisapproximately50%largerthanPFFilt,andintermediatelevelsaresomewhereinbetween.
DuringtheIC40processingitisexpectedthattheoriginalPFFiltfiles,andall3outputlevelswillbeheldondiscsimultaneously.With30GBperdayofPFFiltdataarrivingoverthesatellite,thetotalrequirementofofflinefilteringwillbeapproximately105GBperday.ThustosupportofflinefilteringforIC40approximately50TBofstoragewillberequired,whichincludesasmalloverheadmargin.
AtthestartoftheIC60yeartheIC40non-satellitefiltereddatawillarrivefromSouthPoleontapes.ThisdatawillneedprocessingconcurrentlyastheIC60offlinefilteringstarts.Atthispointtheoutputofatleast2oftheIC40processinglevelscouldbedeletedprovidingthespacefortheprocessingoftheextraIC40data.TheHSMstoragesystemwouldalsobeutilizedinrestoringandstoringthenewSouthPoledata.HoweveratthisstageadditionalonlinestorageneedstobeaddedfortheoutputofIC60processing.Itisanticipatedthatatthisstagethefilteringwillbematureenoughthattheoutputofalltheprocessinglevelswillnothavetobesavedsimultaneouslyandthatanadditional60TBofstoragewouldbeadequate.ForIC80itisanticipatedthat100TBadditionalstoragewouldagainberequired.Atthispointsteadystateisreachedandanadditional100TBeachyearwouldberequiredunlessolderdatastartstobemovedtodifferentstorageareassuchastheHSM.
AnalysisData
MartinMerckandGaryHill
Itisproposedthatanewstorageareabeaddedfororganizedanalysiswithinthecollaboration.Therelevantanalysisareaswouldbeanalysisbeyondlevel2inofflinefiltering,androughlybasedontheanalysisworkinggroups.AssuchitisproposedthattheAnalysisCoordinatorwouldberesponsibleforoversightoftheusageofthisstorage,andforfuturedefinitionofrequirementsinthisarea.
Forplanningpurposesitisproposedthatthecapacityofthisareaexpandatthesamerateasofflinefiltering.Thusitstartwithaninitial50TBthisyear,expand60TBin2009,and100TBeachyearthereafter.
Accesstothisstoragewouldbebasedontheinfrastructurealreadyinplaceforsimulationandexperimentaldata.Forinstance,aleadforananalysisworkinggroupwouldcreateadirectorywithinthisstorageareaviatheDataWarehousewebinterface.Thiswouldcaptureinformationsuchasthepurposefortherequiredstorage,andaresponsibleperson.Itwouldalsoallowforsubsequenteasyaccessviathewebandquerytools.RegularusagesummarieswouldbesenttotheAnalysisCoordinator(anddesignees)foroversightofutilization.
UserData
ThereisasignificantadvantageforallusersofthecollaborationtohavingstoragecloselyconnectedtothecoreDataWarehousestorageandCPUresources.Themainpurposeofthisareaisasatemporarystorageareafordatatransferofforindividualanalysis,andhasthedesiredtechnicalconsequenceofkeepingdataoutofaccounthomedirectories,whichcancausesignificantresourceissues.
Basedonthepresentusagepatternsthecapacityseemtobeonthesmallside.Presentlyeachuserhasadefaultsoftquotaof100GB.Howevercompellingargumentshavebeenmadebynumeroususerswhichhasresultedintemporaryincreasesofupto200GB.Thecapacitywouldbeexpandedtoabitover200TBforallusers,allowinganincreaseindefaultquotasto150GBsoft,and200GBhard.Allinstitutionswouldhavetheopportunitytoaddadditionalstoragebeyondthebase,asUWhas,byanadditionalcontributiontothecommonfund.Otherwiseitisuptoindividualstoworkwithintheallocatedstorage.
Todealwithorganizedanalysiswithinthecollaborationanewstoragetype,analysisstorage,isproposed.
HSM(TapeBasedFilesystem)
AnHSMisatapedbasedfile-system.Dataisstoredontapes,butafrontenddiskbufferandapplicationmakethesystemappearasonlinedisk.Ifanattemptismadetoaccessdatanotonthebufferdisktheapplicationautomaticallyloadsthedatafromtape.Thusitisonlinestoragewithahighlatency.TheprimarypurposeoftheIceCubeHSMsystemisforaccessingthedatastoredattheSouthPole,withtheintroductionofanHSMandtapelibrarythisseasonatPole.HoweveritpresentlyhaslimitedabilitytostoredataattheDataCenterthatisnotassociatedwithSouthPole.Atpresentthemainlimitationisthatthesystemisnotsimultaneouslyreadwrite.
ItisproposedtoupgradetheHSMsystemattheDataCentertoprovideacheaperstoragetechnologyforinfrequentlyaccesseddata,aswellasprovidingaccesstotheSouthPoledata.
Archive
Archivingisatypeofstoragewhichisavailabletotheproject.Howeveritisimportanttounderstandwhatarchivingis.InthecontextofthepresentIceCubeinfrastructure(andnochangetothisisplanned)archivingisfordatathatcanpotentiallybedeletedfromaccessiblestorage,butthereissomesmallassociatedriskindoingso.Thusanarchivecopyismadejustincasethedataisneededinthefutureforunforeseenreasons.Itisnotfortemporaryremovalofdataforrestorationinthefuture.Thustherestorationofarchiveddataisnotplannedorbudgetedandanyfutureaccesswouldneedtofundedwhenrequired.Thereisalsotheissueofdatamigrationastapestoragetechnologiesadvance.Atpres
溫馨提示
- 1. 本站所有資源如無(wú)特殊說(shuō)明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請(qǐng)下載最新的WinRAR軟件解壓。
- 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請(qǐng)聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
- 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁(yè)內(nèi)容里面會(huì)有圖紙預(yù)覽,若沒(méi)有圖紙預(yù)覽就沒(méi)有圖紙。
- 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
- 5. 人人文庫(kù)網(wǎng)僅提供信息存儲(chǔ)空間,僅對(duì)用戶上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理,對(duì)用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對(duì)任何下載內(nèi)容負(fù)責(zé)。
- 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請(qǐng)與我們聯(lián)系,我們立即糾正。
- 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時(shí)也不承擔(dān)用戶因使用這些下載資源對(duì)自己和他人造成任何形式的傷害或損失。
最新文檔
- 水力學(xué)及水能計(jì)算模擬練習(xí)題(附參考答案)
- 2025年軌道交通空氣過(guò)濾器合作協(xié)議書(shū)
- 2025年中國(guó)雙尖燈泡行業(yè)市場(chǎng)發(fā)展前景及發(fā)展趨勢(shì)與投資戰(zhàn)略研究報(bào)告
- 第10課 近代以來(lái)的世界貿(mào)易與文化交流的擴(kuò)展 教學(xué)設(shè)計(jì)-2023-2024學(xué)年高二下學(xué)期歷史統(tǒng)編版(2019)選擇性必修3文化交流與傳播
- 2025年拖拉小熊玩具行業(yè)深度研究分析報(bào)告
- 第13課《賣油翁》教學(xué)設(shè)計(jì) 2023-2024學(xué)年統(tǒng)編版語(yǔ)文七年級(jí)下冊(cè)
- 電工技術(shù)及實(shí)訓(xùn)模擬試題(附參考答案)
- 2025年軌道交通裝備用涂料項(xiàng)目發(fā)展計(jì)劃
- 2025年網(wǎng)站建設(shè)項(xiàng)目建議書(shū)
- 2025年懸掛式離子風(fēng)機(jī)項(xiàng)目發(fā)展計(jì)劃
- 詩(shī)詞寫作入門課件
- 2023年上海青浦區(qū)區(qū)管企業(yè)統(tǒng)一招考聘用筆試題庫(kù)含答案解析
- 植物之歌觀后感
- 理發(fā)店個(gè)人門面轉(zhuǎn)讓合同
- 空氣能熱泵安裝示意圖
- 建筑工程施工質(zhì)量驗(yàn)收規(guī)范檢驗(yàn)批填寫全套表格示范填寫與說(shuō)明
- 2020年中秋國(guó)慶假日文化旅游市場(chǎng)安全生產(chǎn)檢查表
- 03J111-1 輕鋼龍骨內(nèi)隔墻
- 資產(chǎn)負(fù)債表模板范本
- 人教版高中數(shù)學(xué)選擇性必修二導(dǎo)學(xué)案
- 昆明天大礦業(yè)有限公司尋甸縣金源磷礦老廠箐-小凹子礦段(擬設(shè))采礦權(quán)出讓收益評(píng)估報(bào)告
評(píng)論
0/150
提交評(píng)論