Modern Embedded Recipes · 68/152

RTOS Queue 활용 — By-Value·By-Reference·Timeout 패턴

2026년 4월 15일 · Hawk · 4분 읽기

#한 줄 요약

“Queue는 producer가 데이터를 복사해 넣고 consumer가 복사해 꺼내는 FIFO입니다.” 작은 데이터는 by-value, 큰 데이터는 by-pointer가 표준입니다.

#어떤 상황에서 쓰나

ISR이 ADC 샘플을 받아 task에 넘기고, BLE stack이 packet을 application에 전달하고, sensor task가 측정값을 logger에 보내는 모든 흐름이 queue입니다. Semaphore는 신호만 전달하지만 queue는 데이터까지 같이 보냅니다.

선택지가 있는 부분은 얼마나 큰 데이터를 어떻게 넘길지입니다. 항상 by-value로 보내면 큰 buffer를 매번 복사하느라 CPU가 새고, 항상 by-pointer로 보내면 lifetime과 ownership을 잘못 설계해 use-after-free가 발생합니다.

#핵심 개념

API	동작
`xQueueCreate(N, sizeof(T))`	N개 슬롯, 항목 크기 `sizeof(T)`
`xQueueSend`	뒤에 추가 (by value, 복사)
`xQueueSendToFront`	앞에 추가 (긴급 신호)
`xQueueReceive`	앞에서 제거 (by value, 복사)
`xQueuePeek`	제거하지 않고 읽기

Queue는 내부에서 memcpy(slot, &item, sizeof(T))를 합니다. 그래서 sizeof(T)가 크면 send 비용이 그만큼 늘어납니다.

방식	적합한 경우
by-value	T가 32 B 이하 — 단순, 안전, lifetime 걱정 없음
by-pointer	T가 큼 — pool에서 받아 pointer만 send
zero-copy	DMA buffer를 미리 할당, index만 queue로 전달

Backpressure는 queue가 full일 때 producer가 어떻게 행동할지의 정책입니다. block, drop, replace 세 가지 중 선택합니다.

#코드 / 실제 사용 예

#By-value (작은 메시지)

1
typedef struct { uint32_t ts; int16_t x, y, z; } sample_t;
2
QueueHandle_t samples;
3

4
void task_imu(void *arg) {
5
    sample_t s;
6
    for (;;) {
7
        read_accel(&s.x, &s.y, &s.z);
8
        s.ts = xTaskGetTickCount();
9
        if (xQueueSend(samples, &s, 0) != pdTRUE)
10
            stats.drop++;        /* full이면 즉시 drop */
11
    }
12
}
13

14
void task_logger(void *arg) {
15
    sample_t s;
16
    for (;;) {
17
        xQueueReceive(samples, &s, portMAX_DELAY);
18
        log_sample(&s);
19
    }
20
}
21

22
int main(void) {
23
    samples = xQueueCreate(64, sizeof(sample_t));
24
}

10 byte 정도의 sample은 by-value가 가장 단순합니다. lifetime 걱정이 없고 send 후 sender의 stack은 자유롭게 재사용할 수 있습니다.

#By-pointer (큰 메시지)

1
typedef struct { uint8_t data[1024]; size_t len; } pkt_t;
2

3
QueueHandle_t pkt_q;
4
static pkt_t pool[8];
5
SemaphoreHandle_t pool_sem;
6

7
pkt_t *pkt_alloc(void) {
8
    if (xSemaphoreTake(pool_sem, 0) != pdTRUE) return NULL;
9
    /* free list에서 하나 가져오기 */
10
    return pool_pop();
11
}
12

13
void pkt_free(pkt_t *p) {
14
    pool_push(p);
15
    xSemaphoreGive(pool_sem);
16
}
17

18
void task_rx(void *arg) {
19
    for (;;) {
20
        pkt_t *p = pkt_alloc();
21
        radio_read(p->data, &p->len);
22
        xQueueSend(pkt_q, &p, portMAX_DELAY);   /* pointer 한 word만 복사 */
23
    }
24
}
25

26
void task_parse(void *arg) {
27
    pkt_t *p;
28
    for (;;) {
29
        xQueueReceive(pkt_q, &p, portMAX_DELAY);
30
        parse(p);
31
        pkt_free(p);
32
    }
33
}

1 KB의 packet을 매번 복사하면 32-bit ARM에서도 수 µs가 듭니다. Pool에서 미리 할당해두고 pointer만 send하면 send 비용이 4 byte로 떨어집니다. 대신 lifetime을 명확히 관리해야 합니다.

#ISR에서 send

1
void UART_IRQHandler(void) {
2
    BaseType_t hp = pdFALSE;
3
    uint8_t byte = USART1->DR;
4
    xQueueSendFromISR(rx_q, &byte, &hp);
5
    portYIELD_FROM_ISR(hp);
6
}

*FromISR을 잊으면 critical section이 어긋납니다. ISR이 byte 단위로 push하고 task가 line 단위로 처리하는 패턴이 매우 흔합니다.

#Backpressure 정책 비교

1
/* drop on full — sender가 빠를 때 */
2
xQueueSend(q, &item, 0);
3

4
/* block on full — sender가 늦춰져도 됨 */
5
xQueueSend(q, &item, portMAX_DELAY);
6

7
/* replace on full — 최신 값만 의미 있음 */
8
xQueueOverwrite(latest_q, &item);    /* length=1 queue에만 */

센서 stream처럼 손실해도 되는 데이터는 drop이 적절합니다. command queue는 block이, “현재 값”만 의미 있는 telemetry는 overwrite가 적절합니다.

#Queue set으로 다중 입력 대기

1
QueueSetHandle_t qs = xQueueCreateSet(20);
2
xQueueAddToSet(cmd_q, qs);
3
xQueueAddToSet(timer_q, qs);
4

5
void task_dispatch(void *arg) {
6
    for (;;) {
7
        QueueHandle_t active = xQueueSelectFromSet(qs, portMAX_DELAY);
8
        if (active == cmd_q) {
9
            cmd_t c; xQueueReceive(cmd_q, &c, 0);
10
            handle_cmd(&c);
11
        } else if (active == timer_q) {
12
            tick_t t; xQueueReceive(timer_q, &t, 0);
13
            handle_tick(&t);
14
        }
15
    }
16
}

여러 queue를 동시에 기다리는 패턴입니다. Linux의 epoll에 해당합니다. 다만 queue set은 memory를 더 쓰니 정말 필요할 때만 씁니다.

#Stream buffer (가변 길이)

1
StreamBufferHandle_t sb = xStreamBufferCreate(1024, 1);
2

3
/* sender — 가변 길이 byte stream */
4
xStreamBufferSend(sb, "hello\n", 6, 0);
5

6
/* receiver */
7
char buf[64];
8
size_t n = xStreamBufferReceive(sb, buf, sizeof(buf), portMAX_DELAY);

UART나 USB CDC처럼 byte 단위로 들어오는 stream에는 queue보다 stream buffer가 자연스럽습니다.

#측정 / 성능 비교

1
연산                              시간 (Cortex-M4 72 MHz)
2
xQueueSend (sizeof 12 B)          1.4 µs
3
xQueueSend (sizeof 256 B)         3.2 µs  ← memcpy 비용
4
xQueueSend (pointer only, 4 B)    1.1 µs
5
xQueueReceive (block → wake)      6.8 µs
6
xStreamBufferSend (32 B)          1.3 µs

대형 메시지를 by-value로 보내면 send 비용이 빠르게 증가합니다. 64 byte를 넘으면 pointer 방식을 검토할 가치가 있습니다.

1
RAM 사용량
2
queue (N=64, item 12 B)           Queue 구조 + 768 B
3
queue (N=8, pointer 4 B)          Queue 구조 + 32 B + pool 8 KB

By-pointer는 pool RAM이 별도로 필요하지만, 전체로 보면 비슷하거나 더 작은 경우가 많습니다.

#자주 보는 함정

Lifetime 관리 실패 (by-pointer)

1
void send_msg(void) {
2
    char buf[64];               /* stack */
3
    sprintf(buf, "hello");
4
    xQueueSend(q, &buf, 0);     /* receiver가 받기 전에 buf 소멸 */
5
}

Stack 변수의 pointer를 send하면 receiver가 garbage를 읽습니다. Pool 또는 static buffer에서만 보내야 합니다.

Pool exhaustion 무시

1
pkt_t *p = pkt_alloc();
2
xQueueSend(q, &p, 0);    /* p가 NULL이면 receiver crash */

Pool이 비면 alloc이 NULL을 돌려줍니다. 항상 check하고 backpressure 처리를 정의해둡니다.

sizeof 실수

1
QueueHandle_t q = xQueueCreate(64, sizeof(pkt_t *));   /* 4 B per slot */
2
pkt_t p;
3
xQueueSend(q, &p, 0);    /* by-value인 줄 알았다면 첫 4 byte만 복사됨 */

Create와 send/receive의 sizeof가 어긋나면 silent corruption이 발생합니다. by-pointer queue임을 명시적으로 표시합니다.

Full에서 portMAX_DELAY로 sender block

1
xQueueSend(q, &item, portMAX_DELAY);   /* consumer가 죽으면 producer도 영구 block */

Backpressure 정책 없이 무한 대기하면 cascading failure가 발생합니다. timeout과 drop counter를 함께 둡니다.

#정리

Queue는 by-value memcpy가 기본입니다. sizeof가 클수록 send 비용이 늘어납니다.
64 byte 이상은 by-pointer + pool 패턴이 거의 항상 더 빠릅니다.
ISR은 xQueueSendFromISR과 portYIELD_FROM_ISR을 함께 씁니다.
Backpressure 정책(block, drop, replace)을 코드 단위로 명시합니다.
Queue set은 여러 입력을 한 task에서 대기할 때 씁니다.
Stream buffer는 byte stream에, queue는 record 단위에 적합합니다.
By-pointer는 lifetime과 pool exhaustion을 항상 확인합니다.

다음 편은 Event Group입니다. 여러 비트로 multi-condition wait을 다룹니다.

RTOS Queue 활용 — By-Value·By-Reference·Timeout 패턴

#한 줄 요약

#어떤 상황에서 쓰나

#핵심 개념

#코드 / 실제 사용 예

#By-value (작은 메시지)

#By-pointer (큰 메시지)

#ISR에서 send

#Backpressure 정책 비교

#Queue set으로 다중 입력 대기

#Stream buffer (가변 길이)

#측정 / 성능 비교

#자주 보는 함정

#정리

#관련 항목

Modern Embedded Recipes · 69 of 152

관련 글

MPMC Queue 구현 — Multi-producer Multi-consumer Lock-Free

RTOS 디버깅 기법 — Tracealyzer·SystemView·Stack 추적

RTOS Software Timer 활용 — One-shot·Auto-reload·Daemon Task

이 글을 참조하는 글 (4)