wise-monitoring

Devops Ninegravity

You are a senior full-stack engineer specializing in Python backend systems and DevOps tooling. Build a production-ready monitoring system in Python using **FastAPI** that covers five core monitoring domains: 1. **Website/Uptime Monitoring** — HTTP checks, response time measurement, status code validation, and downtime detection. 2. **Database Storage Monitoring** — Track disk usage, collection/table sizes, document/row counts, and storage growth trends; alert when thresholds are exceeded. 3. **Database Outage Detection** — Probe database connectivity and query health; detect and report connection failures, timeouts, and replication issues. 4. **Server Monitoring & Utilization** — Collect CPU, memory, disk I/O, and network metrics via `psutil`; alert when utilization exceeds defined thresholds. ## Target Databases The system must support monitoring for all of the following databases: - **MongoDB** — use `motor` (async) for connectivity checks, storage metrics, collection sizes, document counts, and replica set health. - **PostgreSQL** — use `asyncpg` for connectivity checks, table sizes, row counts, and replication lag. - **MySQL / MariaDB** — use `aiomysql` for connectivity checks, table sizes, and query health. - **Redis** — use `aioredis` for connectivity checks, memory usage, key counts, and eviction metrics. Each database monitor must be implemented as a standalone module so new database types can be added without modifying existing monitors. ## Technical Stack & Requirements - **Framework:** FastAPI (Python) - **Persistence:** Store all check results, historical metrics, and alert logs in MongoDB — design schemas to support trend queries over time - **Scheduler:** Use `APScheduler` or `asyncio`-based polling for interval-driven checks; all checks must run asynchronously and never block each other - **Error isolation:** Failures in one monitor must never crash the system — implement per-monitor error handling, graceful retries, and structured logging - **Configuration:** Externalize all settings (URLs, connection strings, thresholds, polling intervals) into a `.env` file and `config.py` using `pydantic-settings` ## System Components Build the following as distinct, cleanly separated modules: - **`monitors/`** — Individual monitor implementations (`website`, `mongodb`, `postgresql`, `mysql`, `redis`, `server`) - **`scheduler/`** — Orchestrates all polling intervals and dispatches monitor runs - **`storage/`** — Data access layer for persisting and querying check results and metrics - **`alerting/`** — Threshold evaluation and notification dispatch (log-based + webhook stub) - **`api/`** — FastAPI routers exposing endpoints to: register monitors, retrieve current status, fetch historical metrics, and acknowledge alerts - **`dashboard/`** — A simple HTML/JS status dashboard served by FastAPI showing live monitor states ## Code Quality Standards - Production-level error handling and structured logging throughout (`loguru` or Python `logging`) - Clear docstrings and inline comments explaining each module's responsibility - Pydantic models for all request/response schemas and internal data structures - Basic unit tests for monitor logic using `pytest` and `pytest-asyncio` ## Deliverables Provide the complete project structure, all source files with full working code, a `requirements.txt`, a `.env.example` config file, and setup/run instructions. The system must be runnable locally with `uvicorn`, a running MongoDB instance, and optionally connected target databases.

Comments (0)

No comments yet. Be the first!

As a Frontend Developer, I want to implement a centralized theme and color system from the mock-design pages so that all UI components share a consistent visual language across the application.

AI 80%

Human 20%

High Priority

1 day

AI Credits:7

Frontend Developer

As a DevOps professional, I want the Metrics page implemented from the existing JSX design so that I can inspect historical check results and filter data by custom time ranges.

Depends on:#1

Waiting for dependencies

AI 88%

Human 12%

High Priority

1.5 days

AI Credits:6

Frontend Developer

As a DevOps professional, I want a Login page implemented from the existing JSX design so that I can securely authenticate and access the monitoring platform.

Depends on:#1

Waiting for dependencies

AI 90%

Human 10%

High Priority

1 day

AI Credits:5

Frontend Developer

As an admin, I want the Alerts page implemented from the existing JSX design so that I can view active alerts and acknowledge them directly from the UI.

Depends on:#1

Waiting for dependencies

AI 87%

Human 13%

High Priority

1.5 days

AI Credits:6

Frontend Developer

As a DevOps professional, I want the Monitors page implemented from the existing JSX design so that I can register, view, and manage all website, database, and server monitors in one place.

Depends on:#1

Waiting for dependencies

AI 88%

Human 12%

High Priority

1.5 days

AI Credits:6

Frontend Developer

As an admin, I want the Settings page implemented from the existing JSX design so that I can configure thresholds, polling intervals, and database connection settings from the UI.

Depends on:#1

Waiting for dependencies

AI 85%

Human 15%

Medium Priority

1.5 days

AI Credits:6

Frontend Developer

As a DevOps professional, I want the Dashboard page implemented with the Galaxy Map and Live Status views from the existing JSX design so that I can get an at-a-glance overview of all monitored systems, acknowledge alerts, and inspect individual monitors.

Depends on:#1

Waiting for dependencies

AI 90%

Human 10%

High Priority

2 days

AI Credits:8

Frontend Developer

As an admin, I want the Logs page implemented from the existing JSX design so that I can review structured alert logs and filter them by severity, monitor, and time range for incident analysis.

Depends on:#1

Waiting for dependencies

AI 87%

Human 13%

Medium Priority

1 day

AI Credits:5

Frontend Developer

As a Backend Developer, I want a Monitors Registration API so that the frontend can register, list, update, and delete website, database, and server monitor configurations stored in MongoDB.

Depends on:#4

Waiting for dependencies

AI 70%

Human 30%

High Priority

2 days

AI Credits:7

Backend Developer

As an admin, I want a Settings and Configuration API backed by pydantic-settings so that threshold values, polling intervals, and database connection settings configured in the Settings page are persisted and applied at runtime.

Depends on:#8

Waiting for dependencies

AI 68%

Human 32%

Medium Priority

1.5 days

AI Credits:6

Backend Developer

As a Backend Developer, I want a Server Monitor Engine using psutil so that the system can asynchronously collect CPU, memory, disk I/O, and network utilization metrics from registered server targets.

Depends on:#9

Waiting for dependencies

AI 65%

Human 35%

High Priority

2 days

AI Credits:7

Backend Developer

As a Backend Developer, I want a Website Monitor Engine so that the system can perform async HTTP/HTTPS checks, measure response time, validate status codes, and detect downtime for registered website monitors.

Depends on:#9

Waiting for dependencies

AI 65%

Human 35%

High Priority

2 days

AI Credits:7

Backend Developer

As a Backend Developer, I want a Database Monitor Engine with standalone modules for MongoDB (motor), PostgreSQL (asyncpg), MySQL (aiomysql), and Redis (aioredis) so that the system can check connectivity, storage metrics, and query health for each database type.

Depends on:#9

Waiting for dependencies

AI 60%

Human 40%

High Priority

3 days

AI Credits:9

Backend Developer

As a Backend Developer, I want an APScheduler-based Scheduler Service so that all website, database, and server monitor polling jobs run asynchronously at their configured intervals without blocking each other.

Depends on:#16#11#12#10

Waiting for dependencies

AI 60%

Human 40%

High Priority

2 days

AI Credits:7

Backend Developer

As a DevOps professional, I want a Metrics and Historical Data API so that all check results are stored as time-series records in MongoDB and can be queried with time-range and granularity filters to power the Metrics page charts.

Depends on:#10#12#5#11

Waiting for dependencies

AI 65%

Human 35%

High Priority

2.5 days

AI Credits:8

Backend Developer

As a DevOps professional, I want real-time WebSocket or polling support integrated into the Dashboard so that the Galaxy Map and Live Status views reflect the latest monitor states without a manual page refresh.

Depends on:#17#13#3

Waiting for dependencies

AI 62%

Human 38%

High Priority

2.5 days

AI Credits:8

Backend Developer

As an admin, I want an Alerts API so that the system evaluates metric thresholds, creates and persists alert records in MongoDB, and exposes CRUD plus an acknowledge endpoint consumed by the Alerts page.

Depends on:#6#13

Waiting for dependencies

AI 65%

Human 35%

High Priority

2 days

AI Credits:7

Backend Developer

As an admin, I want a Logs API so that structured alert and system logs are stored via loguru and can be retrieved with filtering by severity, monitor name, and time range to power the Logs page.

Depends on:#7#14

Waiting for dependencies

AI 68%

Human 32%

Medium Priority

1.5 days

AI Credits:6

Backend Developer

As a Backend Developer, I want an Alerting Engine so that triggered threshold breaches dispatch notifications via webhook stubs and structured log outputs, ensuring admins are promptly notified of critical events.

Depends on:#17#14

Waiting for dependencies

AI 60%

Human 40%

Medium Priority

2 days

AI Credits:7

Backend Developer

wise-monitoring

Comments (0)

Project Tasks19

Implement Theme Color System

Build Metrics Page

Build Login Page

Build Alerts Page

Build Monitors Page

Build Settings Page

Build Dashboard Page

Build Logs Page

Implement Monitors Registration API

Implement Settings Config API

Implement Server Monitor Engine

Implement Website Monitor Engine

Implement Database Monitor Engine

Implement Scheduler Service

Implement Metrics Historical Data API

Implement Real-Time Dashboard Updates

Implement Alerts API

Implement Logs API

Implement Alerting Engine