RQ: Jobs

For some use cases it might be useful have access to the current job ID or instance from within the job function itself. Or to store arbitrary data on jobs.

RQ’s Job Object

The Job Lifecycle

The life-cycle of a worker consists of a few phases:

Queued. When queue.enqueue(foo) is called, a Job will be created and it’s ID pushed into the queue. job.get_status() will return queued.
Started. When a worker picks up a job from queue, the job status will be set to started. In this phase an Execution object will be created and it’s composite_key put in StartedJobRegistry.
Finished. After an execution has ended, execution will be removed from StartedJobRegistry. A Result object that holds the result of the execution will be created. Both the Job and Result key will persist in Redis until the value of result_ttl is up. More details here.

Job Status

The status of a job can be one of the following:

queued: The default status for created jobs, except for those that have dependencies, which will be created as deferred. These jobs have been placed in a queue and are ready to be executed.
finished: The job has finished execution and is available through the finished job registry.
failed: Jobs that encountered errors during execution or expired before being executed.
started: The job has started execution. This status includes the job execution support mechanisms, such as setting the worker name and setting up heartbeat information.
deferred: The job is not ready for execution because its dependencies have not finished successfully yet.
scheduled: Jobs created to run at a future date or jobs that are retried after a retry interval.
stopped: The job was stopped because the worker was stopped.
canceled: The job has been manually canceled and will not be executed, even if it is part of a dependency chain.

These statuses can also be accessed from the job object using boolean properties, such as job.is_finished.

Job Creation

When you enqueue a function, a job will be returned. You may then access the id property, which can later be used to retrieve the job.

from rq import Queue
from redis import Redis
from somewhere import count_words_at_url

redis_conn = Redis()
q = Queue(connection=redis_conn)  # no args implies the default queue

# Delay execution of count_words_at_url('http://nvie.com')
job = q.enqueue(count_words_at_url, 'http://nvie.com')
print('Job id: %s' % job.id)

Or if you want a predetermined job id, you may specify it when creating the job.

job = q.enqueue(count_words_at_url, 'http://nvie.com', job_id='my_job_id')

A job can also be created directly with Job.create().

from rq.job import Job

job = Job.create(count_words_at_url, 'http://nvie.com')
print('Job id: %s' % job.id)
q.enqueue_job(job)

# create a job with a predetermined id
job = Job.create(count_words_at url, 'http://nvie.com', id='my_job_id')

The keyword arguments accepted by create() are:

timeout specifies the maximum runtime of the job before it’s interrupted and marked as failed. Its default unit is seconds and it can be an integer or a string representing an integer(e.g. 2, '2'). Furthermore, it can be a string with specify unit including hour, minute, second (e.g. '1h', '3m', '5s').
result_ttl specifies how long (in seconds) successful jobs and their results are kept. Expired jobs will be automatically deleted. Defaults to 500 seconds.
ttl specifies the maximum queued time (in seconds) of the job before it’s discarded. This argument defaults to None (infinite TTL).
failure_ttl specifies how long (in seconds) failed jobs are kept (defaults to 1 year)
depends_on specifies another job (or job id) that must complete before this job will be queued.
id allows you to manually specify this job’s id
description to add additional description to the job
connection
status
origin where this job was originally enqueued
meta a dictionary holding custom status information on this job
args and kwargs: use these to explicitly pass arguments and keyword to the underlying job function. This is useful if your function happens to have conflicting argument names with RQ, for example description or ttl.

In the last case, if you want to pass description and ttl keyword arguments to your job and not to RQ’s enqueue function, this is what you do:

job = Job.create(count_words_at_url,
          ttl=30,  # This ttl will be used by RQ
          args=('http://nvie.com',),
          kwargs={
              'description': 'Function description', # This is passed on to count_words_at_url
              'ttl': 15  # This is passed on to count_words_at_url function
          })

Retrieving Jobs

All job information is stored in Redis. You can inspect a job and its attributes by using Job.fetch().

from redis import Redis
from rq.job import Job

redis = Redis()
job = Job.fetch('my_job_id', connection=redis)
print('Status: %s' % job.get_status())

Some interesting job attributes include:

job.get_status(refresh=True) Possible values are queued, started, deferred, finished, stopped, scheduled, canceled and failed. If refresh is True fresh values are fetched from Redis.
job.get_meta(refresh=True) Returns custom job.meta dict containing user stored data. If refresh is True fresh values are fetched from Redis.
job.origin queue name of this job
job.func_name
job.args arguments passed to the underlying job function
job.kwargs key word arguments passed to the underlying job function
job.result stores the return value of the job being executed, will return None prior to job execution. Results are kept according to the result_ttl parameter (500 seconds by default).
job.enqueued_at
job.started_at
job.ended_at
job.exc_info stores exception information if job doesn’t finish successfully.
job.last_heartbeat the latest timestamp that’s periodically updated when the job is executing. Can be used to determine if the job is still active.
job.worker_name returns the worker name currently executing this job.
job.refresh() Update the job instance object with values fetched from Redis.

If you want to efficiently fetch a large number of jobs, use Job.fetch_many().

jobs = Job.fetch_many(['foo_id', 'bar_id'], connection=redis)
for job in jobs:
    print('Job %s: %s' % (job.id, job.func_name))

Job Executions

New in 2.0

When a job is being executed, RQ stores it’s execution data in Redis. You can access this data via Execution objects.

from redis import Redis
from rq.job import Job

redis = Redis()
job = Job.fetch('my_job_id', connection=redis)
executions = job.get_executions()  # Returns all current executions
execution = job.get_executions()[0]  # Retrieves a single execution
print(execution.created_at)  # When did this execution start?
print(execution.last_heartbeat)  # Worker's last heartbeat

Execution objects have a few properties:

id: ID of an execution.
job: the Job object that owns this execution instance
composite_key: a combination of job.id and execution.id, formatted as <job_id>:<execution_id>
created_at: returns a datetime object representing the start of this execution
last_heartbeat: worker’s last heartbeat

Stopping a Currently Executing Job

New in version 1.7.0

You can use send_stop_job_command() to tell a worker to immediately stop a currently executing job. A job that’s stopped will be sent to FailedJobRegistry.

from redis import Redis
from rq.command import send_stop_job_command

redis = Redis()

# This will raise an exception if job is invalid or not currently executing
send_stop_job_command(redis, job_id)

Unlike failed jobs, stopped jobs will not be automatically retried if retry is configured. Subclasses of Worker which override handle_job_failure() should likewise take care to handle jobs with a stopped status appropriately.

Canceling a Job

New in version 1.10.0

To prevent a job from running, cancel a job, use job.cancel().

from redis import Redis
from rq.job import Job
from rq.registry import CanceledJobRegistry
from .queue import Queue

redis = Redis()
job = Job.fetch('my_job_id', connection=redis)
job.cancel()

job.get_status()  # Job status is CANCELED

registry = CanceledJobRegistry(job.origin, connection=job.connection)
print(job in registry)  # Job is in CanceledJobRegistry

Canceling a job will remove:

Sets job status to CANCELED
Removes job from queue
Puts job into CanceledJobRegistry

Note that job.cancel() does not delete the job itself from Redis. If you want to delete the job from Redis and reclaim memory, use job.delete().

Note: if you want to enqueue the dependents of the job you are trying to cancel use the following:

from rq import cancel_job
cancel_job(
  '2eafc1e6-48c2-464b-a0ff-88fd199d039c',
  enqueue_dependents=True
)

Job / Queue Creation with Custom Serializer

When creating a job or queue, you can pass in a custom serializer that will be used for serializing / de-serializing job arguments. Serializers used should have at least loads and dumps method. The default serializer used is pickle.

from rq import Queue
from rq.job import Job
from rq.serializers import JSONSerializer

job = Job(id="my-job", connection=connection, serializer=JSONSerializer)
queue = Queue(connection=connection, serializer=JSONSerializer)

Accessing The “current” Job from within the job function

Since job functions are regular Python functions, you must retrieve the job in order to inspect or update the job’s attributes. To do this from within the function, you can use:

from rq import get_current_job

def add(x, y):
    job = get_current_job()
    print('Current job: %s' % (job.id,))
    return x + y

Note that calling get_current_job() outside of the context of a job function will return None.

Storing arbitrary data on jobs

Improved in 0.8.0.

To add/update custom status information on this job, you have access to the meta property, which allows you to store arbitrary pickleable data on the job itself:

import socket

def add(x, y):
    job = get_current_job()
    job.meta['handled_by'] = socket.gethostname()
    job.save_meta()

    # do more work
    time.sleep(1)
    return x + y

Time to live for job in queue

A job has two TTLs, one for the job result, result_ttl, and one for the job itself, ttl. The latter is used if you have a job that shouldn’t be executed after a certain amount of time.

# When creating the job:
job = Job.create(func=say_hello,
                 result_ttl=600,  # how long (in seconds) to keep the job (if successful) and its results
                 ttl=43,  # maximum queued time (in seconds) of the job before it's discarded.
                )

# or when queueing a new job:
job = q.enqueue(count_words_at_url,
                'http://nvie.com',
                result_ttl=600,  # how long to keep the job (if successful) and its results
                ttl=43  # maximum queued time
               )

Job Position in Queue

For user feedback or debuging it is possible to get the position of a job within the work queue. This allows to track the job processing through the queue.

This function iterates over all jobs within the queue and therefore does perform poorly on very large job queues.

from rq import Queue
from redis import Redis
from hello import say_hello

redis_conn = Redis()
q = Queue(connection=redis_conn)

job = q.enqueue(say_hello)
job2 = q.enqueue(say_hello)

job2.get_position()
# returns 1

q.get_job_position(job)
# return 0

Failed Jobs

If a job fails during execution, the worker will put the job in a FailedJobRegistry. On the Job instance, the is_failed property will be true. FailedJobRegistry can be accessed through queue.failed_job_registry.

from redis import Redis
from rq import Queue
from rq.job import Job


def div_by_zero(x):
    return x / 0


connection = Redis()
queue = Queue(connection=connection)
job = queue.enqueue(div_by_zero, 1)
registry = queue.failed_job_registry

worker = Worker([queue])
worker.work(burst=True)

assert len(registry) == 1  # Failed jobs are kept in FailedJobRegistry

By default, failed jobs are kept for 1 year. You can change this by specifying failure_ttl (in seconds) when enqueueing jobs.

job = queue.enqueue(foo_job, failure_ttl=300)  # 5 minutes in seconds

Requeuing Failed Jobs

If you need to manually requeue failed jobs, here’s how to do it:

from redis import Redis
from rq import Queue

connection = Redis()
queue = Queue(connection=connection)
registry = queue.failed_job_registry

# This is how to get jobs from FailedJobRegistry
for job_id in registry.get_job_ids():
    registry.requeue(job_id)  # Puts job back in its original queue

assert len(registry) == 0  # Registry will be empty when job is requeued

Starting from version 1.5.0, RQ also allows you to automatically retry failed jobs.

Requeuing Failed Jobs via CLI

RQ also provides a CLI tool that makes requeuing failed jobs easy.

# This will requeue foo_job_id and bar_job_id from myqueue's failed job registry
rq requeue --queue myqueue -u redis://localhost:6379 foo_job_id bar_job_id

# This command will requeue all jobs in myqueue's failed job registry
rq requeue --queue myqueue -u redis://localhost:6379 --all