tHe puZZlingG wOrlD: Top and Bottom Halves

Source

10.4. Top and Bottom Halves

One of the main problems with interrupt handling is how to perform lengthy tasks

within a handler. Often a substantial amount of work must be done in response to

a device interrupt, but interrupt handlers need to finish up quickly and not keep

interrupts blocked for long. These two needs (work and speed) conflict with each

other, leaving the driver writer in a bit of a bind. Linux (along with many other

systems) resolves this problem by splitting the interrupt handler into two halves.

The so-called top half is the routine that actually responds to the interrupt—the

one you register with request_irq. The bottom half is a routine that is scheduled

by the top half to be executed later, at a safer time. The big difference between the

top-half handler and the bottom half is that all interrupts are enabled during

execution of the bottom half—that's why it runs at a safer time. In the typical

scenario, the top half saves device data to a device-specific buffer, schedules its

bottom half, and exits: this operation is very fast. The bottom half then performs

whatever other work is required, such as awakening processes, starting up another

I/O operation, and so on. This setup permits the top half to service a new interrupt

while the bottom half is still working.

Almost every serious interrupt handler is split this way. For instance, when a

network interface reports the arrival of a new packet, the handler just retrieves

the data and pushes it up to the protocol layer; actual processing of the packet

is performed in a bottom half.

The Linux kernel has two different mechanisms that may be used to implement

bottom-half processing, both of which were introduced in Chapter 7. Tasklets are

often the preferred mechanism for bottom-half processing; they are very fast, but

all tasklet code must be atomic. The alternative to tasklets is workqueues, which

may have a higher latency but that are allowed to sleep.

The following discussion works, once again, with the short driver. When loaded with

a module option, short can be told to do interrupt processing in a top/bottom-half

mode with either a tasklet or workqueue handler. In this case, the top half executes

quickly; it simply remembers the current time and schedules the bottom half

processing. The bottom half is then charged with encoding this time and awakening

any user processes that may be waiting for data.

10.4.1. Tasklets

Remember that tasklets are a special function that may be scheduled to run, in

software interrupt context, at a system-determined safe time. They may be

scheduled to run multiple times, but tasklet scheduling is not cumulative; the

tasklet runs only once, even if it is requested repeatedly before it is launched.

No tasklet ever runs in parallel with itself, since they run only once, but tasklets

can run in parallel with other tasklets on SMP systems. Thus, if your driver has

multiple tasklets, they must employ some sort of locking to avoid conflicting

with each other.

Tasklets are also guaranteed to run on the same CPU as the function that first

schedules them. Therefore, an interrupt handler can be secure that a tasklet does

not begin executing before the handler has completed. However, another interrupt

can certainly be delivered while the tasklet is running, so locking between the

tasklet and the interrupt handler may still be required.

Tasklets must be declared with the DECLARE_TASKLET macro:

DECLARE_TASKLET(name, function, data);

name is the name to be given to the tasklet, function is the function that is called to

execute the tasklet (it takes one unsigned long argument and returns void), and

data is an unsigned long value to be passed to the tasklet function.

The short driver declares its tasklet as follows:

void short_do_tasklet(unsigned long);
DECLARE_TASKLET(short_tasklet, short_do_tasklet, 0);

The function tasklet_schedule is used to schedule a tasklet for running.

If short is loaded with tasklet=1, it installs a different interrupt handler that

saves data and schedules the tasklet as follows:

irqreturn_t short_tl_interrupt(int irq, void *dev_id, struct pt_regs *regs)
{

/* cast to stop 'volatile' warning */
do_gettimeofday((struct timeval *) tv_head);

short_incr_tv(&tv_head);
    tasklet_schedule(&short_tasklet);
    short_wq_count++; /* record that an interrupt arrived */
    return IRQ_HANDLED;
}

The actual tasklet routine, short_do_tasklet, will be executed shortly

(so to speak) at the system's convenience. As mentioned earlier, this routine

performs the bulk of the work of handling the interrupt; it looks like this:

void short_do_tasklet (unsigned long unused)
{
    int savecount = short_wq_count, written;
    short_wq_count = 0; /* we have already been removed from the queue */
    /*
     * The bottom half reads the tv array, filled by the top half,
     * and prints it to the circular text buffer, which is then consumed
     * by reading processes
     */

    /* First write the number of interrupts that occurred before this bh */
    written = sprintf((char *)short_head,"bh after %6i\n",savecount);
    short_incr_bp(&short_head, written);

    /*
     * Then, write the time values. Write exactly 16 bytes at a time,
     * so it aligns with PAGE_SIZE
     */

    do {
        written = sprintf((char *)short_head,"%08u.%06u\n",
                (int)(tv_tail->tv_sec % 100000000),
                (int)(tv_tail->tv_usec));
        short_incr_bp(&short_head, written);
        short_incr_tv(&tv_tail);
    } while (tv_tail != tv_head);

    wake_up_interruptible(&short_queue); /* awake any reading process */
}

Among other things, this tasklet makes a note of how many interrupts have

arrived since it was last called. A device such as short can generate a great

many interrupts in a brief period, so it is not uncommon for several to arrive

before the bottom half is executed. Drivers must always be prepared for this

possibility and must be able to determine how much work there is to perform

from the information left by the top half.

10.4.2. Workqueues

Recall that workqueues invoke a function at some future time in the context

of a special worker process. Since the workqueue function runs in process

context, it can sleep if need be. You cannot, however, copy data into user

space from a workqueue, unless you use the advanced techniques we

demonstrate in Chapter 15; the worker process does not have access to any

other process's address space.

The short driver, if loaded with the wq option set to a nonzero value, uses a

workqueue for its bottom-half processing. It uses the system default workqueue,

so there is no special setup code required; if your driver has special latency

requirements (or might sleep for a long time in the workqueue function), you

may want to create your own, dedicated workqueue. We do need awork_struct

structure, which is declared and initialized with the following:

static struct work_struct short_wq;

    /* this line is in short_init(  ) */
    INIT_WORK(&short_wq, (void (*)(void *)) short_do_tasklet, NULL);

Our worker function is short_do_tasklet, which we have already seen in

the previous section.

When working with a workqueue, short establishes yet another interrupt

handler that looks like this:

irqreturn_t short_wq_interrupt(int irq, void *dev_id, struct pt_regs *regs)
{
    /* Grab the current time information. */
    do_gettimeofday((struct timeval *) tv_head);
    short_incr_tv(&tv_head);

    /* Queue the bh. Don't worry about multiple enqueueing */
    schedule_work(&short_wq);

    short_wq_count++; /* record that an interrupt arrived */
    return IRQ_HANDLED;
}

As you can see, the interrupt handler looks very much like the tasklet

version, with the exception that it calls schedule_work to arrange the

bottom-half processing.

tHe puZZlingG wOrlD

Saturday, February 4, 2012

Top and Bottom Halves

10.4. Top and Bottom Halves

10.4.1. Tasklets

10.4.2. Workqueues

No comments:

Post a Comment