Онлайн-поток данных

Online data flow описывает синхронные потоки данных в системе — запросы от клиентов через API Gateway к микросервисам и обратно. Используется для операций, требующих немедленного ответа.

Архитектура

graph TB subgraph "Client Layer" WebApp[Web Application] MobileApp[Mobile App] CLI[CLI Tools] end subgraph "Edge Layer" LB[Load Balancer
Ingress NGINX] Gateway[API Gateway
Envoy Proxy] end subgraph "Service Layer" Identity[Identity Service
gRPC] Credential[Credential Service
gRPC] Auth[Auth Service
gRPC] Account[Account Service
gRPC] end subgraph "Data Layer" PostgresIdentity[(PostgreSQL
Identity DB)] PostgresCredential[(PostgreSQL
Credential DB)] RedisCache[(Redis
Cache & Sessions)] end WebApp -->|HTTPS/REST| LB MobileApp -->|HTTPS/REST| LB CLI -->|HTTPS/REST| LB LB --> Gateway Gateway -->|gRPC| Identity Gateway -->|gRPC| Credential Gateway -->|gRPC| Auth Gateway -->|gRPC| Account Identity --> PostgresIdentity Identity --> RedisCache Credential --> PostgresCredential Credential --> RedisCache Auth --> RedisCache Account --> PostgresIdentity style Gateway fill:#4A90E2 style PostgresIdentity fill:#51B749 style RedisCache fill:#DC382C

Типы online потоков

1. REST API (Public)

Протокол: HTTP/1.1, HTTP/2 Формат: JSON Аутентификация: JWT Bearer tokens

Примеры endpoints:

POST   /api/v1/auth/register        # Регистрация пользователя
POST   /api/v1/auth/login           # Аутентификация
POST   /api/v1/auth/refresh         # Обновление токенов
POST   /api/v1/auth/logout          # Выход из системы
GET    /api/v1/users/me             # Получение профиля
PUT    /api/v1/users/me             # Обновление профиля
POST   /api/v1/identifiers/verify   # Верификация email/phone

Request/Response Example:

POST /api/v1/auth/register HTTP/2
Host: api.aiops.io
Content-Type: application/json
User-Agent: AIOps-Web/1.0

{
  "username": "john_doe",
  "email": "john@example.com",
  "password": "SecureP@ssw0rd!"
}

HTTP/2 201 Created
Content-Type: application/json

{
  "user": {
    "id": "550e8400-e29b-41d4-a716-446655440000",
    "username": "john_doe",
    "status": "active",
    "created_at": "2024-03-09T10:30:00Z"
  },
  "tokens": {
    "access_token": "eyJhbGciOiJSUzI1NiIs...",
    "refresh_token": "eyJhbGciOiJSUzI1NiIs...",
    "expires_in": 900
  }
}

2. gRPC (Internal)

Протокол: HTTP/2 with Protocol Buffers Аутентификация: Service-to-service mutual TLS (планируется)

Service Definitions:

// identity.proto
service IdentityService {
  rpc CreateUser(CreateUserRequest) returns (CreateUserResponse);
  rpc GetUser(GetUserRequest) returns (GetUserResponse);
  rpc GetUserByUsername(GetUserByUsernameRequest) returns (GetUserResponse);
  rpc GetUserByIdentifier(GetUserByIdentifierRequest) returns (GetUserResponse);
}

// credential.proto
service CredentialService {
  rpc SetPassword(SetPasswordRequest) returns (SetPasswordResponse);
  rpc VerifyPassword(VerifyPasswordRequest) returns (VerifyPasswordResponse);
  rpc ChangePassword(ChangePasswordRequest) returns (ChangePasswordResponse);
}

Python Client Example:

import grpc
from aiops.identity.v1 import identity_pb2, identity_pb2_grpc

async with grpc.aio.insecure_channel('identity-service:50051') as channel:
    stub = identity_pb2_grpc.IdentityServiceStub(channel)

    response = await stub.GetUser(
        identity_pb2.GetUserRequest(user_id=user_id)
    )

    print(f"User: {response.user.username}")

Детальные flow patterns

Pattern 1: User Registration

sequenceDiagram participant Client participant Gateway participant Identity participant Credential participant PostgresI participant PostgresC participant Outbox participant Redis Client->>Gateway: POST /api/v1/auth/register
{username, email, password} Note over Gateway: 1. Validate request body Note over Gateway: 2. Rate limit check Gateway->>Identity: CreateUser(username) Identity->>PostgresI: BEGIN TRANSACTION Identity->>PostgresI: INSERT INTO users
(id, username, status) Identity->>Outbox: INSERT INTO outbox_events
(UserCreatedEvent) Identity->>PostgresI: COMMIT Identity-->>Gateway: User{id, username, status} Gateway->>Credential: SetPassword(user_id, password) Note over Credential: Hash password with Argon2id Credential->>PostgresC: BEGIN TRANSACTION Credential->>PostgresC: INSERT INTO password_credentials
(user_id, password_hash) Credential->>Outbox: INSERT INTO outbox_events
(PasswordSetEvent) Credential->>PostgresC: COMMIT Credential-->>Gateway: PasswordCredential{user_id} Gateway->>Identity: AttachIdentifier(user_id, email, challenge_id) Identity->>PostgresI: INSERT INTO identifiers
(user_id, type, value, verified_at) Identity-->>Gateway: Identifier{id, verified_at} Gateway->>Gateway: CreateSession + GenerateTokens Gateway->>Redis: SET session:{id} {data} EX 2592000 Gateway-->>Client: 201 Created
{user, tokens} Note over Outbox: Background worker publishes events to Kafka

Характеристики: - Latency: ~200-500ms (зависит от Argon2id hashing) - Transactions: 2 отдельные БД транзакции (Identity, Credential) - Consistency: Eventually consistent через events

sequenceDiagram participant Client participant Gateway participant Identity participant Credential participant Auth participant Redis participant PostgresI participant PostgresC Client->>Gateway: POST /api/v1/auth/login
{username, password} Gateway->>Identity: GetUserByUsername(username) Identity->>Redis: GET user:username:{username} alt Cache hit Redis-->>Identity: Cached user data else Cache miss Identity->>PostgresI: SELECT * FROM users
WHERE username = ? PostgresI-->>Identity: User record Identity->>Redis: SETEX user:username:{username}
TTL=300 end Identity-->>Gateway: User{id, status} alt User not found or disabled Gateway-->>Client: 401 Unauthorized end Gateway->>Credential: VerifyPassword(user_id, password) Note over Credential: Load password_hash from DB Credential->>PostgresC: SELECT password_hash
FROM password_credentials
WHERE user_id = ? PostgresC-->>Credential: password_hash Note over Credential: Argon2id verification
(constant-time) alt Invalid password Credential-->>Gateway: {is_valid: false} Gateway-->>Client: 401 Unauthorized end Credential-->>Gateway: {is_valid: true} Gateway->>Auth: CreateSession(user_id, metadata) Auth->>Redis: SET session:{id} {user_id, ...} EX 2592000 Auth->>Auth: GenerateTokens(user_id, session_id) Auth-->>Gateway: {access_token, refresh_token} Gateway-->>Client: 200 OK
{access_token, refresh_token}

Характеристики: - Latency: ~150-300ms (с кешем), ~300-600ms (без кеша) - Cache strategy: Username → User lookup кешируется на 5 минут - Security: Constant-time password verification

Pattern 3: Authenticated API Request

sequenceDiagram participant Client participant Gateway participant Service participant Redis participant PostgresS Client->>Gateway: "GET /api/v1/users/me
Authorization: Bearer JWT" Note over Gateway: "1. Extract JWT from header" Note over Gateway: "2. Verify signature (JWKS)" Note over Gateway: "3. Check exp, iss, aud" Note over Gateway: "4. Extract user_id from sub" Gateway->>Service: "GET /users/me
x-user-id, x-session-id" Service->>Redis: "GET user key" alt Cache hit Redis-->>Service: Cached user data else Cache miss Service->>PostgresS: SELECT user WHERE id PostgresS-->>Service: User record Service->>Redis: "SETEX user key TTL=300" end Service-->>Gateway: User data Gateway-->>Client: "200 OK + user data"

Характеристики: - Latency: ~10-50ms (с кешем), ~50-150ms (без кеша) - Cache TTL: 5 минут для user data - Validation: JWT валидируется на Gateway, сервис доверяет x-user-id

Pattern 4: Data Mutation with Cache Invalidation

sequenceDiagram participant Client participant Gateway participant Service participant Redis participant PostgresS Client->>Gateway: "PUT /api/v1/users/me
Authorization Bearer JWT
body username" Gateway->>Service: "PUT /users/id
x-user-id" Service->>PostgresS: BEGIN TRANSACTION Service->>PostgresS: "UPDATE users SET username WHERE id" Service->>PostgresS: COMMIT Note over Service: Invalidate caches Service->>Redis: "DEL user key" Service->>Redis: "DEL user username key" Service-->>Gateway: Updated user data Gateway-->>Client: "200 OK + updated user"

Характеристики: - Cache invalidation: Удаление всех связанных cache keys - Consistency: Write-through pattern (сначала DB, потом cache)

Performance Optimizations

1. Connection Pooling

PostgreSQL (через PgBouncer):

# sqlalchemy-postgres-kit
async_engine = create_async_engine(
    DATABASE_URL,
    pool_size=20,              # Max connections per pod
    max_overflow=10,           # Burst capacity
    pool_pre_ping=True,        # Health check before use
    pool_recycle=3600,         # Recycle connections after 1h
)

Redis:

# redis-client-kit
redis_client = AsyncRedisClient(
    host="redis-master",
    port=6379,
    max_connections=50,
    socket_keepalive=True,
    health_check_interval=30,
)

2. Caching Strategy

Multi-level cache:

@cache(ttl=300, key_prefix="user")
async def get_user_by_id(user_id: UUID) -> User | None:
    """
    1. Check Redis cache
    2. If miss, query PostgreSQL
    3. Store result in Redis
    """
    cached = await redis.get(f"user:{user_id}")
    if cached:
        return User.parse_raw(cached)

    user = await db.query(UserDB).filter_by(id=user_id).first()
    if user:
        await redis.setex(
            f"user:{user_id}",
            ttl=300,
            value=user.to_domain().json()
        )

    return user.to_domain() if user else None

Cache invalidation patterns: - Write-through: DB write → cache invalidate - TTL-based: Auto-expire после N секунд - Event-based: Kafka event → invalidate cache

3. Request Batching (DataLoader pattern)

from aiodataloader import DataLoader

class UserLoader(DataLoader):
    async def batch_load_fn(self, user_ids: list[UUID]) -> list[User | None]:
        """Load multiple users in single query"""
        users = await db.query(UserDB).filter(
            UserDB.id.in_(user_ids)
        ).all()

        user_map = {u.id: u for u in users}
        return [user_map.get(uid) for uid in user_ids]

# Usage in single request
user_loader = UserLoader()

# These calls are batched into single DB query
user1 = await user_loader.load(user_id_1)
user2 = await user_loader.load(user_id_2)
user3 = await user_loader.load(user_id_3)

4. Read Replicas (planned)

Observability

Request Tracing

OpenTelemetry spans:

from opentelemetry import trace

tracer = trace.get_tracer(__name__)

@tracer.start_as_current_span("get_user")
async def get_user(user_id: UUID) -> User:
    with tracer.start_as_current_span("cache.get"):
        cached = await redis.get(f"user:{user_id}")

    if not cached:
        with tracer.start_as_current_span("db.query"):
            user = await db.query(UserDB).get(user_id)

        with tracer.start_as_current_span("cache.set"):
            await redis.setex(f"user:{user_id}", 300, user.json())

    return user

Metrics

Prometheus metrics:

from prometheus_client import Histogram, Counter

request_duration = Histogram(
    'http_request_duration_seconds',
    'HTTP request latency',
    ['method', 'endpoint', 'status']
)

cache_hits = Counter(
    'cache_hits_total',
    'Number of cache hits',
    ['cache_type', 'key_prefix']
)

cache_misses = Counter(
    'cache_misses_total',
    'Number of cache misses',
    ['cache_type', 'key_prefix']
)

Error Handling & Retry

Circuit Breaker

from circuitbreaker import circuit

@circuit(failure_threshold=5, recovery_timeout=60)
async def call_external_service():
    """Auto-open circuit after 5 failures"""
    response = await http_client.post(...)
    return response

Retry with Exponential Backoff

from tenacity import retry, stop_after_attempt, wait_exponential

@retry(
    stop=stop_after_attempt(3),
    wait=wait_exponential(multiplier=1, min=1, max=10)
)
async def query_database():
    """Retry DB query up to 3 times"""
    return await db.query(...)

Связанные страницы

Event Flow — асинхронные потоки данных через Kafka
Data Ownership — кто владеет какими данными
API Design Principles — REST API guidelines

Потоки данных Пакетный поток данных

На странице

Архитектура Типы online потоков 1. REST API (Public) 2. gRPC (Internal) Детальные flow patterns Pattern 1: User Registration Pattern 2: Authentication (Login) Pattern 3: Authenticated API Request Pattern 4: Data Mutation with Cache Invalidation Performance Optimizations 1. Connection Pooling 2. Caching Strategy 3. Request Batching (DataLoader pattern) 4. Read Replicas (planned) Observability Request Tracing Metrics Error Handling & Retry Circuit Breaker Retry with Exponential Backoff Связанные страницы