Course Outline

Designing an Open AIOps Architecture

  • Overview of key components in open AIOps pipelines
  • Data flow from ingestion to alerting
  • Tool comparison and integration strategy

Data Collection and Aggregation

  • Ingesting time-series data with Prometheus
  • Capturing logs with Logstash and Beats
  • Normalizing data for cross-source correlation

Building Observability Dashboards

  • Visualizing metrics with Grafana
  • Building Kibana dashboards for log analytics
  • Using Elasticsearch queries to extract operational insights

Anomaly Detection and Incident Prediction

  • Exporting observability data to Python pipelines
  • Training ML models for outlier detection and forecasting
  • Deploying models for live inference in the observability pipeline

Alerting and Automation with Open Tools

  • Creating Prometheus alert rules and Alertmanager routing
  • Triggering scripts or API workflows for auto-response
  • Using open-source orchestration tools (e.g., Ansible, Rundeck)

Integration and Scalability Considerations

  • Handling high-volume ingestion and long-term retention
  • Security and access control in open-source stacks
  • Scaling each layer independently: ingestion, processing, alerting

Real-World Applications and Extensions

  • Case studies: performance tuning, downtime prevention, and cost optimization
  • Extending pipelines with tracing tools or service graphs
  • Best practices for running and maintaining AIOps in production

Summary and Next Steps

Requirements

  • Experience with observability tools such as Prometheus or ELK
  • Working knowledge of Python and machine learning fundamentals
  • Understanding of IT operations and alerting workflows

Audience

  • Advanced site reliability engineers (SREs)
  • Data engineers working in operations
  • DevOps platform leads and infrastructure architects
 14 Hours

Number of participants


Price per participant

Upcoming Courses

Related Categories


Fatal error: Uncaught TypeError: _isl_get_excluded_site(): Return value must be of type ?array, none returned in /apps/hitra7/backdrop/modules/_custom/frontend/islc7/isl_common.inc:38 Stack trace: #0 /apps/hitra7/backdrop/modules/_custom/frontend/islc7/isl_common.inc(30): _isl_get_excluded_site() #1 /apps/hitra7/backdrop/modules/_custom/frontend/islc7/isl_common.inc(17): isl_get_excluded_site() #2 /apps/hitra7/backdrop/modules/_custom/frontend/islc7/islc7.module(51): get_outline_isls() #3 /apps/hitra7/backdrop/modules/_custom/frontend/islc7/islc7.module(7): islc_prepare_links() #4 /apps/hitra7/npfrontend/nptemplates/default.php(272): islc7_sites_links_array_v3() #5 /apps/hitra7/npfrontend/modules/course/course.php(143): require_once('...') #6 /apps/hitra7/npfrontend/core/routes.php(15): course_menu_callback() #7 /apps/hitra7/npfrontend/__index.php(81): require_once('...') #8 /apps/hitra7/npfrontend/index.php(15): include_once('...') #9 /apps/hitra7/index.php(66): include_once('...') #10 {main} thrown in /apps/hitra7/backdrop/modules/_custom/frontend/islc7/isl_common.inc on line 38