RTOS debugging, part 2: Your tasks need that CPU; don’t starve them of it

September 5, 2017 Dr. Johan Kraft, Percepio AB

In embedded systems using multitasking, you may run into a situation where some of your tasks run slowly or not at all. This is called task starvation (the affected tasks are starved of CPU time) and it can happen for a number of reasons. Today we will look at one particular reason – a fixed priority schedule where the task priorities happen to be wrong, or at least unsuitable.

In a scenario with fixed priorities, the developer needs to assign a runtime priority for each task when it is created. High-priority tasks should always execute before tasks with lower priority, but when the low-priority tasks receive too little CPU time to do their jobs, you may have a starvation problem. This can manifest itself in many ways. For example, the device may work perfectly most of the time, but sometimes it becomes unresponsive.

A naïve fix would be to just raise priorities of the affected tasks; this kind of thinking would ultimately render prioritization useless, since every task would become “highest priority”. Instead, be very careful about what tasks receive high scheduling priority: reserve it for tasks that are predictable, execute in a recurring pattern like "every x milliseconds", and ideally also have a short execution time compared to the recurring interval.

It follows that handlers for external events, like a user pressing a button or network traffic, should execute with relatively low priority since these are almost by definition irregular events and the handlers can be invoked at any time. And to no-one’s great surprise, tasks with a long execution time should also execute at low priority.

"But wait," you may be thinking by now, "I have time-critical interrupt service routines that require a lot of CPU time to move all their data." The best solution in that case is to use hardware features like DMA to offload the processor core, but otherwise it is often a good idea to split such processing into several tasks and use different priorities at different stages. Isolate the really time-critical stuff in a small, high-priority task and have that task alert a medium-priority task to do the bulk of the processing. You may also have a third stage, implemented in a task that runs at low priority.

It is very important to put tasks to sleep when they have no more work to do. You should avoid polling and other forms of busy wait – after all, that amounts to little more than a waste of processor time. After completion, your task should delay itself or wait for an RTOS event (e.g., a semaphore signal), as this will allow the RTOS to schedule tasks with lower priority while the task is waiting. The sleeping task is revived by the RTOS when suitable.

If you have a really critical system, you may want to look into something called rate-monotonic scheduling (RMS). That is a formal technique for calculating the response times resulting from a set of tasks, given their respective execution times. During the analysis, tasks are assigned priorities based on their execution time, so that the quickest task runs with the highest priority.

RMS can give you a formal guarantee that task starvation will not happen or, seen from the other angle, tell you how much processing capacity your system needs to avoid starvation. The flip side is that you need to have really good information about your tasks – execution times, periodicity, etc. – to carry out your analysis.

Dr. Johan Kraft is CEO and CTO of Percepio AB, a Swedish tool company developing visual analysis tools that accelerate embedded software development. Dr. Kraft holds a Ph.D. in Computer Science and his academic work up until 2010 focused on practical methods for timing analysis of embedded software, performed in close collaboration with regional software-oriented industry. Before his doctoral studies, Dr. Kraft started his career as an embedded software developer, working with control software for industrial robots.

Previous Article
Shift-left your safety-critical software testing with test automation
Shift-left your safety-critical software testing with test automation

The trend in major safety-critical projects shows exponential growth in total costs, with software being a ...

Next Article
RTOS debugging, part 1: Dealing with timing issues
RTOS debugging, part 1: Dealing with timing issues

There are several reasons that debugging an RTOS system is different from debugging a single-threaded “supe...

×

Stay updated on Developer Tools & Operating Systems with the Dev Tools and OS edition of our Embedded Daily newsletter

Subscribed! Look for 1st copy soon.
Error - something went wrong!