XJ-AIOps

Materials about Artificial Intelligence for IT Operations (AIOps).

Researchers
Industrial Materials
Academic Materials
- Talks
- Workshops
Papers
Competitions
Datasets
- CUHK 香港中文大学
- Microsoft 微软
- Tsinghua 清华 CCF 国际AIOps大赛
- Google 谷歌
- Backblaze 云备份和存储供应商
- Alibaba-智能运维大赛
- AIOps2022通信网络智能运维大赛数据集
- GAIA-DataSet
- 华为网络运维数据集
Tools, Models and Systems
Community
Internet articles
Others
- Courses
Reference

一.Researchers

China (& HK SAR)
Michael R. Lyu, CUHK	Dongmei Zhang, Microsoft	Pengfei Chen, SYSU	Dan Pei, Tsinghua
Pengfei Chen, SYSU	Xin Peng, Fudan	Qingwei Lin, Microsoft

USA
Ryan Huang, JHU	Yingnong Dang, Microsoft	Christina Delimitrou, MIT EECS
Europe
Odej Kao, TU Berlin
Australia
Hongyu Zhang, UON

Dan Pei-Tsinghua(裴丹-清华)

Pengfei Chen(陈鹏飞-中山)

Qingwei Lin(林庆维-微软)

Pinjia He(贺品嘉-港中文)

Shenglin Zhang(张圣林-南开)

Hanzhang Wang(王含璋-eBay)

二. Industrial Materials

White Papers

[VMware] Proactive Incident and Problem Management
[GREATOPS 高效运维社区] 《企业级 AIOps 实施建议》白皮书
[Awesome Open Source] Aiops Handbook

Blogs & Tutorials & Magazines

[Tsinghua University] 清华裴丹：AIOps落地的15条原则
[Tsinghua University] 清华裴丹：AIOps效果落地最后一公里
[Alibaba Cloud] 基于大数据的智能网络分析-齐天
[Tsinghua University] 清华裴丹：AIOps效果落地最后一公里
[Moogsoft] What is AIOps?
[Microsoft] Advancing Azure service quality with artificial intelligence: AIOps

Companies

Datadog: A monitoring and security platform for cloud applications
必示 bizseer
听云 TINGYUN: 端到端的全平台应用性能管理系统
Loom Systems

三. Academic Materials

Talks

[Michael R. Lyu] Reliability-Driven AIOps for Cloud Resilience (Keynote talk at ICSE '21)

Workshops

四. Papers

Survey & Empirical Study

[arXiv '21] Experience Report: Deep Learning-based System Log Analysis for Anomaly Detection
[CSUR '21] A Survey on Automated Log Analysis for Reliability Engineering
[ESEC/FSE '20] Towards intelligent incident management: why we need it and how we make it
[arXiv '20] A Systematic Mapping Study in AIOps
[ICSE '19] AIOps: Real-World Challenges and Research Innovations
[ISSRE '16] Experience Report: System Log Analysis for Anomaly Detection
[ASE '13] Software analytics for incident management of online services: An experience report

The benchmarks

Knowledge Graph for AIOps

[ICSE-SEIP '22] Mining Root Cause Knowledge from Cloud Service Incident Investigations for AIOps
[ICSE-SEIP '21] Neural knowledge extraction from cloud service incidents
[arXiv '21] SoftNER: Mining Knowledge Graphs From Cloud Incidents
[APPLSCI '20] A Causality Mining and Knowledge Graph Based Method of Root Cause Diagnosis for Performance Anomaly in Cloud Applications

Microservices and Serverless

VM Analysis and Management

Deployment

五. Competitions

CCF国际AIOps挑战赛

[AIOps Challenge] A series of AIOps competitions hosted by Tsinghua University

阿里巴巴

华为

AIOps 2022 通信网络智能运维大赛

六. Datasets

CUHK 香港中文大学

[CUHK] Loghub

Microsoft 微软

[Microsoft Azure] Azure Public Dataset

Tsinghua 清华

Google 谷歌

[Google] Cluster Traces

Backblaze 云备份和存储供应商

[Backblaze] Hard Drive Dataset

Alibaba 智能运维大赛数据集

[Alibaba] SMART Dataset of PAKDD CUP 2020
[Alibaba] SSD SMART logs and failure data

Ceph Drive Telemetry Data

[Ceph Drive] kaggle

GAIA-DataSet

GAIA

七. Tools、Models and Systems

[Log Analytics] LogPAI
[AI for Cloud Operation] OpsPAI
[Outlier Detection] PyOD
[Anomaly Detection] ADTK
[Anomaly Detection] PySAD
[Online Machine Learning] River
[Online Machine Learning] scikit-multiflow
[Fault Injection] Chaos Mesh
[Fault Injection] ChaosBlade
[Container Monitoring] cAdvisor
[Performance Monitoring] Netdata
[Anomaly Detection Labeling Tool] Microsoft TagAnomaly
[Serverless App Dev. Framework] AWS Serverless Application Model (AWS SAM)
[Fudan] Train Ticket (A Benchmark Microservice System) 该项目是一个基于微服务架构的火车票预订系统，包含41个微服务。
[Weaveworks] Sock Shop (A Microservices Demo Application) 袜子店模拟了一个销售袜子的电子商务网站中面向用户的部分。它的目的是帮助演示和测试微服务和云原生技术。

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
数据集及其EDA		数据集及其EDA
比赛方案分享和分析/AIOps2022通信网络智能运维大赛		比赛方案分享和分析/AIOps2022通信网络智能运维大赛
论文代码复现及分析/InterFusion		论文代码复现及分析/InterFusion
论文阅读笔记		论文阅读笔记
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

License

muxunting/XJ_AIOps

Folders and files

Latest commit

History

Repository files navigation

XJ-AIOps

一.Researchers

Dan Pei-Tsinghua(裴丹-清华)

Pengfei Chen(陈鹏飞-中山)

Qingwei Lin(林庆维-微软)

Pinjia He(贺品嘉-港中文)

Shenglin Zhang(张圣林-南开)

Hanzhang Wang(王含璋-eBay)

二. Industrial Materials

White Papers

Blogs & Tutorials & Magazines

Companies

三. Academic Materials

Talks

Workshops

四. Papers

Survey & Empirical Study

The benchmarks

Knowledge Graph for AIOps

Microservices and Serverless

Dependency and Tracing

Anomaly and Failure Detection

Incident and Alarm Management

Node, Disk, and Storage

VM Analysis and Management

Deployment

五. Competitions

CCF国际AIOps挑战赛

阿里巴巴

华为

六. Datasets

CUHK 香港中文大学

Microsoft 微软

Tsinghua 清华

Google 谷歌

Backblaze 云备份和存储供应商

Alibaba 智能运维大赛数据集

Ceph Drive Telemetry Data

GAIA-DataSet

七. Tools、Models and Systems

八. Community

九. Internet articles

十. Others

Courses

十一. Reference

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages