Job Management

This guide covers how to monitor and manage background jobs in Cascadia PLM.

Overview

Cascadia uses a RabbitMQ-backed background job system for async processing. Jobs handle tasks like:

Workflow Notifications: Sending email/webhook notifications on workflow transitions
Bulk Operations: Processing large data sets asynchronously
Scheduled Tasks: Recurring maintenance operations

Administrators can monitor job status, retry failed jobs, and cancel pending jobs from the Admin dashboard.

Accessing Job Management

Navigate to Admin > Jobs from the main navigation
The dashboard displays real-time job statistics and a filterable job list

Job Dashboard

Statistics Cards

The top of the page shows count cards for:

Metric	Description
Total	All jobs in the system
Pending	Jobs waiting to be queued
Queued	Jobs in the RabbitMQ queue
Running	Jobs currently being processed
Completed	Successfully finished jobs
Failed	Jobs that encountered errors

Auto-Refresh

Toggle Auto-refresh to enable 5-second polling for real-time updates. Useful when monitoring active job processing.

Job List

The data grid displays all jobs with the following columns:

Column	Description
Type	Job type identifier (e.g., `notification.workflow.transition`)
Status	Current status with color-coded badge
Priority	Job priority level (Low, Normal, High, Critical)
Progress	Visual progress bar with percentage
Attempts	Current attempt count vs. maximum (e.g., "2/3")
Created	Timestamp when job was submitted
Error	Error message if failed (hover for full text)

Status Colors

Status	Color	Description
pending	Gray	Waiting to be queued
queued	Blue	In RabbitMQ queue
running	Amber	Currently processing
completed	Green	Finished successfully
failed	Red	Encountered an error
cancelled	Gray	Manually cancelled

Filtering

Use the filter controls to narrow down jobs by:

Status: Show only jobs in a specific state
Type: Filter by job type identifier

Job Actions

Click the ... menu on any job row to access available actions:

View Details

Opens a modal with comprehensive job information:

Basic Information:

Type, Status, Priority
Attempt count and maximum attempts
Creation and completion timestamps

Payload Section:

JSON display of the job's input data
Useful for debugging what parameters were passed

Result Section:

JSON display of the job's output data (completed jobs only)
Shows what the job produced

Logs Section:

Structured log entries with timestamps
Color-coded by level (debug, info, warn, error)
Includes any additional data attached to log entries

Error Section (failed jobs):

Full error message displayed in a code block
Stack trace if available

Retry Job

Available only for failed jobs.

Click Retry Job from the row menu
The job is reset:
- Status changed to pending
- Attempt counter reset to 0
- Error and result cleared
Job is re-queued for processing

Use this when:

A transient error caused the failure (network timeout, service unavailable)
You've fixed the underlying issue
You want to force immediate reprocessing

Cancel Job

Available only for pending or queued jobs.

Click Cancel Job from the row menu
Confirm the cancellation in the dialog
Job is marked as cancelled with a completion timestamp

Use this when:

A job was submitted incorrectly
The task is no longer needed
You need to prevent processing before it starts

Note: You cannot cancel jobs that are already running. Wait for them to complete or fail.

Job Lifecycle

┌─────────┐     ┌────────┐     ┌─────────┐     ┌───────────┐
│ Pending │ ──→ │ Queued │ ──→ │ Running │ ──→ │ Completed │
└─────────┘     └────────┘     └─────────┘     └───────────┘
     │               │               │
     │               │               ↓
     │               │         ┌────────┐
     │               │         │ Failed │ ←─── (can retry)
     │               │         └────────┘
     ↓               ↓
┌───────────┐
│ Cancelled │
└───────────┘

Automatic Retries

When a job fails, the system automatically schedules retries based on the job type's configuration:

Exponential Backoff: Each retry waits longer (e.g., 30s, 2min, 10min)
Max Attempts: Jobs stop retrying after reaching the configured limit
Retry Delays: Configurable per job type

Failed jobs that have exhausted all retry attempts remain in failed status until manually retried or deleted.

Job Priorities

Jobs are processed by priority order within the queue:

Priority	RabbitMQ Value	Use Case
Critical	9	System alerts, security notifications
High	6	User-facing notifications, time-sensitive tasks
Normal	3	Standard background work
Low	1	Maintenance, cleanup, reports

Higher priority jobs are processed before lower priority jobs when workers are busy.

Monitoring Best Practices

Regular Checks

Monitor Failed Jobs: Review failed jobs daily to identify recurring issues
Check Queue Depth: A growing queue may indicate worker capacity issues
Review Logs: Use job logs to debug failures without checking server logs

Troubleshooting Failures

View job details to see the full error message
Check the payload to verify correct input data
Review logs for step-by-step execution history
Check RabbitMQ UI (http://localhost:15672) for queue health

Performance Issues

If jobs are processing slowly:

Increase worker concurrency: Set WORKER_CONCURRENCY env var (default: 5)
Add more workers: Scale horizontally with additional worker containers
Check job types: Some job types may have rate limits or concurrency caps

API Reference

Administrators can also manage jobs programmatically:

List Jobs

GET /api/admin/jobs?status=failed&limit=50

Query parameters:

status - Filter by job status
type - Filter by job type
limit - Max results (default: 100, max: 500)
offset - Pagination offset

Get Job Details

GET /api/admin/jobs/{jobId}

Returns full job object including logs.

Retry Job

POST /api/admin/jobs/{jobId}/retry

Resets and re-queues a failed job.

Cancel Job

POST /api/admin/jobs/{jobId}/cancel

Cancels a pending or queued job.

Overview​

Accessing Job Management​

Job Dashboard​

Statistics Cards​

Auto-Refresh​

Job List​

Status Colors​

Filtering​

Job Actions​

View Details​

Retry Job​

Cancel Job​

Job Lifecycle​

Automatic Retries​

Job Priorities​

Monitoring Best Practices​

Regular Checks​

Troubleshooting Failures​

Performance Issues​

API Reference​

List Jobs​

Get Job Details​

Retry Job​

Cancel Job​

See Also​

Overview

Accessing Job Management

Job Dashboard

Statistics Cards

Auto-Refresh

Job List

Status Colors

Filtering

Job Actions

View Details

Retry Job

Cancel Job

Job Lifecycle

Automatic Retries

Job Priorities

Monitoring Best Practices

Regular Checks

Troubleshooting Failures

Performance Issues

API Reference

List Jobs

Get Job Details

Retry Job

Cancel Job

See Also