




版權(quán)說明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請(qǐng)進(jìn)行舉報(bào)或認(rèn)領(lǐng)
文檔簡(jiǎn)介
1、阿里巴巴智能化運(yùn)維探索PART 01BackgroundPART 02What is SDDPPART 03Key AlgorithmPART 04Deployment and ResultCONTENTSFour Eras of Alibaba Database單機(jī)房 單個(gè)應(yīng)用單機(jī)MySQL同城多機(jī)房 垂直A分 商業(yè) IOE異地雙活 單元化 AliSQL異地多活 云化MLPOLARDB2005 - 2010QPE代2003 - 2004淘寶初創(chuàng)2011 - 2015DO之I2016 -新機(jī)遇新B戰(zhàn)1,000,000 DBs 100 BUsx1000Background - Do More w
2、ith LessAlibaba Database Usage Alibaba GroupFinanceRetailer ManufacturerMedia and EntertainmentInternational ClientsAlibaba Database Usage Public CloudSingles Day (11/11)Alibaba Database OverviewTools and UtilitiesEngineProprietaryOpen Source/Third-partyOperati PlatforDTSADAMData Migration &DB Trans
3、missionMigrationEvaluationDMSDBSGUI for CentralizedDB Management and DevOps,BackupEfficient and SecureServiceHDMHybrid Cloud DBManagementDBAdvisor Intelligent Diagnostics and OptimizationOLTPOLAPNoSQLGraphDBHBase+X-Pack (Multi-model Analysis)POLARDBCloud Native DB: Decoupled Compute and Storage with
4、 Hardware AccelerationAnalyticDBPB-grade Data. High Concurrency.High PerformanceData Lake Analytics Serverless Interactive Query Servicewith Presto and Spark IntegrationRedisAliSQL-MySQL/PG/ MariaDB/MS SQL ServerTSDBTime Series, Spatial Temporal DBMongoDBonCloud Database Operation PlatformmEnd-to-en
5、d Tracking and Monitoring ServiceDatabase Expert ServiceChallenges All in OneManagement at ScaleScheduling ProtectionRuntime Management Optimization Backup/Restore Security Workloads DiversitySLA-driven Workload-aware Agility100 BUs20,000 Developers Super HotspotScalability Stability Cost Cloud- Nat
6、ive DatabaseDBPaaSLarge-scale DB Lifecyle Management InfraSDDPSelf-Driving Database PlatformCommercial DatabaseOracleOpen Source DatabaseAliSQL/MySQL-20102010-2017Human LaborDBAsPOLARDB(VLDB 2018, SIGMOD 2019)AnalyticDB (VLDB 2019)2017-nowJourney to SDDP at AlibabaWhat is SDDP?Self-detectionSelf-rep
7、airingSelf-tuningSelf-securingSelf-decision-makingSelf-Driving Database Platform Database Management PlatformDatabases DiversityPOLARDB, AnalyticDB, AliSQL, MySQL, NoSQL, etc.Self-driving CapabilitiesSelf-detection, Self-decision-makingSelf-repairing, Self-tuning, Self-securingMinimum/Zero Human Lab
8、orSDDP PhilosophySDDP=+Execution (Automation)Feedback+Data (Detection)EyesMachine Learning (Decision Making)BrainHands & FeetLearning from millions of DBs to empowerDBs with availability, security and performance at scale回SDDP LevelsLevelDescriptionDecision MakingKey CapabilitiesLevel 0No Automation
9、HumanLevel 1Human AssistanceHumanStatistics Collecting, Monitoring, Alerting, Scripts/ToolsLevel 2Single Point AutomationHuman + SDDPAnomaly Detection, SQL Advisor, Capacity Planning, Health Diagnostic Framework, etc.Level 3Partial Scenarios AutomationSDDP + HumanAutomatic Repairing, Automatic SQL T
10、uning, Automatic Configuration Tuning, Auto-scaling, Automatic Resource Scheduling, Automatic Data Access Protection, etc.Level 4Full AutomationSDDPFully End-to-End Automation for All Scenarios , No Human InterventionComponents of SDDPAlibaba Self-Driving Database Platform (SDDP) provides cloud data
11、bases with automatic operation and maintenanceKey features of SDDPKnobs tuningHot/cold separationAnomaly detectionNL2SQLClouDBenchL 3ColdSelect where A =1;Workload GeneratorSelect . where B =2;Select where A = 2;Select where A =Workload Replay2;Select where B = 3;Select where A =1;lnsert .;DB Desrec
12、ommeiglnnsert ;Update ;ndUapdtaitoe n;Auto lndex/shardingHuman Languag eSQL DBMSe.g OtterTuneSlow SQLThrottlingiBTune - MotivationThe memory uses at Alibaba product environmentBuffer pool is the largest memory consumerBufferTmallD-ngd-ng Memory is bottleneck among the resourcesHemaBufferBufferBuffer
13、BufferBufferSDDP Self-Dr-v-ng Database PlatformSDDP: Self-Driving Database PlatformDBSyst/m m/tricsSQ& coll/ctionDBAdvisorControl syst/mCold hot mod/l Ind/x mod/l M/mory mod/lMod/l Pr/dictionautotuning slow sql spac/ analysisSQ& & DB M/tricSQ& & DB m/tricsParam/t/r updat/DBDBR/sourc/ sch/dulingSDDPA
14、nomaly d/t/ctionupdat/SDDPMemory buffer tuning iBTune:more than 10,000 instances, memory saving of 20TBTMallBufferTaobaoBufferBufferDingdingBufferBufferBufferAn example: iBTune (individualized BufferTuning, VLDB 2019)DBA manually uses a small number of BP sizes (10 configurations in our case) Each i
15、nstances BP size might be different as the query workload is different Manual tuning is not scalable for large cloud databases since each instance has different BP size iBTune: Individualized Buffer Tuning for Largescale Cloud DatabasesReduce memory (buffer pool) while guaranteeing SLA (response tim
16、e) CDF of individual BP sizesbefore and after the iBTune appliesiBTune - MotivationiBTune - Preliminary AttemptBuffer pool (BP) size is sensitive to miss ratio: BP size is reduced from 188G to 80G when its hit ratio is from 99.968% to 99.950%Response Timehit ratioCPU usageChallenge: Heuristic method
17、 (such as shrinking 10% each time) does not work, since we have to try many times, which makes the system unstable and is unacceptable for mission-critical applications Intuition:Calculate BP based on hit ratio (miss ratio) to avoid restarting system multiple timesConfirm whether the BP size meets t
18、he requirement of SLAtolerate miss ratio(t_miss_ratio) 曰 (tmissratio) = New BP size 2(tmissratio) = Response timePractical functionPairwise DNN21G) during holidays and workdays:Red line is the time when BP size is adjustedGreen lines show the holiday which is 7-daysPredicted RT: only 3 points exceed
19、ed which is acceptableThe IO read metric is the real IO, since all our DB instances turn on direct IO10 representative instances. The memory saving ranges from 50% to 10%, which strongly supports that a single number does not fit all. Instance 1 has a large increase in RT after the adjustment. We fi
20、nd that there is one query that consumes 99.97% of the total response time. The lookup value in WHERE condition changes for this query.Multiple instancesConclusion & uture WorkSDDP has been widely used at Alibaba. Its key algorithm “iBTune” has been deployed on 10,000 database instances with memory
21、saving : 17%Future workCache preloadBackup node needs to run SQLs to load data into cache after BP adjustmentPerform switching after preloadBuffer increaseCurrently reply on DBA Automatic increase bufferMultiple parameters tuningDBMS configure fileSDDP Big Picture Architecture, WIPDecision MakingAut
22、omatic RepairingKnowledge BaseDomain KnowledgeResource ManagementData CleaningFeature EngineeringModel TrainingEvaluationDeploymentMachine Learning PipelineSigmaAlibaba CloudKubernetesOthersDatabase OperatingPOLARDBAliSQLMySQLMongoDBPostgreSQL.PerceptionAutomation StoringMessage QueueCollectingProcessingBlinkAction PlannerAction SchedulerAction ExecutorAutomatic TuningSecurity ProtectingResource SchedulingBackup/RestoreAutomatic ScalingE
溫馨提示
- 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請(qǐng)下載最新的WinRAR軟件解壓。
- 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請(qǐng)聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
- 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁內(nèi)容里面會(huì)有圖紙預(yù)覽,若沒有圖紙預(yù)覽就沒有圖紙。
- 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
- 5. 人人文庫網(wǎng)僅提供信息存儲(chǔ)空間,僅對(duì)用戶上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理,對(duì)用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對(duì)任何下載內(nèi)容負(fù)責(zé)。
- 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請(qǐng)與我們聯(lián)系,我們立即糾正。
- 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時(shí)也不承擔(dān)用戶因使用這些下載資源對(duì)自己和他人造成任何形式的傷害或損失。
最新文檔
- 【正版授權(quán)】 IEC 60269-2-1:2004 EN-D Low-voltage fuses - Part 2-1: Supplementary requirements for fuses for use by authorized persons (fuses mainly for industrial application) - Sections
- 【正版授權(quán)】 IEC 60947-8:2003+AMD1:2006 CSV EN-D Low-voltage switchgear and controlgear - Part 8: Control units for built-in thermal protection (PTC) for rotating electrical machines
- 2025年平安教育培訓(xùn)方案表
- 八一建軍節(jié)活動(dòng)方案流程12025年
- 2025年中國旅游日主題活動(dòng)方案
- 大學(xué)商務(wù)禮儀課程教學(xué)
- 2025年學(xué)校數(shù)學(xué)工作方案演講稿
- 籃球課程思政教學(xué)設(shè)計(jì)
- (盛高培訓(xùn)之四)薪酬體系設(shè)計(jì)技術(shù)(資料1)
- 上海思博職業(yè)技術(shù)學(xué)院《體育與健康-健美操》2023-2024學(xué)年第一學(xué)期期末試卷
- 2025年職業(yè)指導(dǎo)師專業(yè)能力測(cè)試卷:職業(yè)指導(dǎo)服務(wù)與心理咨詢
- 學(xué)校安全管理制度匯編
- 2024年山東電力中心醫(yī)院高層次衛(wèi)技人才招聘筆試歷年參考題庫頻考點(diǎn)附帶答案
- 浙江省四校聯(lián)考2023至2024學(xué)年高一下學(xué)期3月月考化學(xué)試題附參考答案(解析)
- 左心衰竭合并肺水腫的護(hù)理查房
- 重力壩畢業(yè)設(shè)計(jì)-水電站混凝土重力壩工程設(shè)計(jì)
- 《淹溺急救》PPT課件(2022版)
- EPC項(xiàng)目管理之安全文明施工責(zé)任制度
- 第二節(jié)模糊綜合評(píng)價(jià)法
- 《雷鋒叔叔_你在哪里》說課稿55481
- CFM567反推裝置介紹
評(píng)論
0/150
提交評(píng)論