Practical RTOS Internals · 6/53

동기화 기초 분석 — Critical Section·Mutual Exclusion·Race Condition

2026년 5월 4일 · Hawk · 5분 읽기

sync critical-section mutex race-condition atomic

#한 줄 요약

“공유 데이터엔 동기화” — Race condition은 간헐 버그의 1위입니다. 짧고 정확한 보호가 핵심입니다.

#Race Condition — 가장 까다로운 버그

1
// 두 task가 같은 counter 증가
2
volatile uint32_t counter = 0;
3

4
void task_a(void *arg) {
5
    while (1) { counter++; vTaskDelay(1); }
6
}
7

8
void task_b(void *arg) {
9
    while (1) { counter++; vTaskDelay(1); }
10
}

counter++이 single instruction 같지만, 어셈블리로 보면:

1
LDR  r0, [counter]    ; (1) read
2
ADD  r0, r0, #1       ; (2) increment
3
STR  r0, [counter]    ; (3) write

3 단계 사이에 preempt 되면 다른 task가 같은 값을 read 하면서 증가분 하나를 잃습니다.

1
시점 →
2
Task A: read=5 . . . . . . . . . . . . . . . . . . increment=6, write=6
3
Task B: . . . . . . read=5 . . . increment=6, write=6 . . . . . .
4
결과: counter = 6 (7이 맞음)

#Critical Section — 보호 구간

“이 코드 블록은 원자적으로 실행한다”는 목표를 달성하기 위한 3가지 도구를 살펴봅니다.

#1. Interrupt Disable

가장 강력합니다. ISR도 막습니다. 짧게(수십 µs) 유지해야 합니다.

1
__disable_irq();
2
counter++;
3
__enable_irq();
4

5
// 또는 FreeRTOS API
6
taskENTER_CRITICAL();
7
counter++;
8
taskEXIT_CRITICAL();

#장단점

✓ ISR과도 안전합니다
✓ Spin·context switch가 없습니다
✗ ISR latency가 늘어 실시간성이 떨어집니다
✗ Long work는 금지입니다

#2. Spinlock (SMP)

여러 코어에서 동작합니다. busy-wait 방식입니다.

1
spin_lock(&lock);
2
shared_data = value;
3
spin_unlock(&lock);

#장단점

✓ 짧은 critical section에 효율적입니다
✓ Context switch가 없어 latency가 결정적입니다
✗ SMP에서만 의미가 있고, 단일 코어에서는 무의미합니다
✗ Hold time이 길어지면 다른 코어가 spin 합니다

#3. Mutex (Task 간)

Blocking 방식입니다. 대기 task가 Blocked 상태로 전환됩니다.

1
xSemaphoreTake(mutex, portMAX_DELAY);
2
shared_data = value;
3
xSemaphoreGive(mutex);

#장단점

✓ Long critical section도 가능합니다
✓ Priority inheritance를 지원합니다 (Mars Pathfinder 해결)
✗ Context switch overhead가 있습니다
✗ ISR에서는 사용할 수 없습니다

#선택 기준

상황	도구
ISR과 task 공유, ≤ 10 µs	Interrupt disable
Task 간 공유, > 100 µs work	Mutex
SMP 짧은 작업	Spinlock
Lock-free 가능	atomic API

#Atomic Operations

CPU가 원자성을 보장하는 명령입니다. 짧고 빠릅니다.

1
#include <stdatomic.h>
2

3
atomic_int counter = 0;
4
atomic_fetch_add(&counter, 1);   // counter++ atomic

ARMv7+ ldrex/strex (Load-Exclusive·Store-Exclusive):

1
loop:
2
    LDREX  r0, [counter]
3
    ADD    r0, r0, #1
4
    STREX  r1, r0, [counter]
5
    CMP    r1, #0           ; STREX 성공?
6
    BNE    loop             ; 실패 시 재시도

CAS (Compare-And-Swap) 변형도 있습니다. Lock-free 자료구조의 토대입니다.

#Memory Ordering — Reordering 함정

ARM·RISC-V는 relaxed memory model을 따르므로 컴파일러와 CPU가 명령 순서를 바꿀 수 있습니다.

1
// Producer
2
data = 42;            // (1)
3
ready = 1;            // (2)
4

5
// Consumer
6
while (!ready);       // (3)
7
use(data);            // (4) — 42 받는 보장 없음!

(1)과 (2)의 write order가 바뀌면 (4)에서 garbage를 읽게 됩니다. 해결책은 memory barrier입니다.

1
data = 42;
2
__sync_synchronize();   // 또는 std::atomic_thread_fence
3
ready = 1;

ARM은 DMB ST (Data Memory Barrier, Store)를 사용하고, x86은 기본적으로 strong order를 가집니다.

1
struct {
2
    int counter_a;        // CPU 0이 자주 쓰기
3
    int counter_b;        // CPU 1이 자주 쓰기
4
} shared;                  // 같은 cache line 64 byte

CPU 0이 counter_a에 write 하면 cache line 전체가 CPU 0의 cache로 들어옵니다. CPU 1이 counter_b에 write 하면 line이 CPU 1로 이동합니다. 코어 간 ping-pong이 발생합니다. 해결책은 padding입니다.

1
struct {
2
    alignas(64) int counter_a;
3
    alignas(64) int counter_b;
4
};

#Critical Section 길이 — 권장

작업	추정 시간	적합 도구
변수 1개 update	50 ns	atomic (또는 IRQ disable)
구조체 update (수십 byte)	1 µs	IRQ disable 또는 mutex
1 KB 데이터 copy	10 µs	mutex
File I/O, network	ms	mutex (절대 IRQ disable 금지)

IRQ disable은 최대 50 µs까지가 안전합니다. 그 이상 길어지면 ISR이 막혀 interrupt loss 위험이 있습니다.

#Volatile — 동기화 ≠

volatile은 컴파일러 최적화 방지 역할만 합니다. Atomic도 아니고 memory order 보장도 아닙니다.

1
volatile int counter = 0;
2
counter++;  // 여전히 3-instruction → race condition

volatile은 MMIO register access(HW)나 interrupt-shared flag 같은 single-byte flag에만 씁니다.

#FreeRTOS API 요약

API	효과
`taskENTER_CRITICAL()`	IRQ mask (BASEPRI) + scheduler suspend
`taskEXIT_CRITICAL()`	복원
`taskENTER_CRITICAL_FROM_ISR()`	ISR 내 critical section
`vTaskSuspendAll()`	Scheduler 정지 (IRQ는 활성)
`xTaskResumeAll()`	복원
`portDISABLE_INTERRUPTS()`	IRQ 완전 mask

#자주 하는 실수

⚠️ volatile로 race condition 해결 시도

위에서 설명했듯이 atomic API 또는 critical section이 필요합니다.

⚠️ Critical section 안에서 long work

100 ms 작업을 critical section 안에 넣으면 그 동안 모든 ISR과 task가 막힙니다. 짧게 유지해야 합니다.

⚠️ Memory barrier 누락

ARM·RISC-V relaxed model에서는 write order가 보장되지 않습니다. 멀티코어·DMA와 공유할 때는 barrier가 필수입니다.

⚠️ Mutex를 ISR에서

ISR에서 mutex take를 시도하면 crash 합니다. Semaphore Give(signal)만 가능합니다.

#정리

Race condition은 공유 데이터, 동시 접근, 동기화 없음이 합쳐질 때 발생합니다.
보호 도구 3종은 IRQ disable, Spinlock(SMP), Mutex입니다.
짧은 작업은 IRQ disable, 긴 작업은 mutex가 적합합니다.
Atomic API가 lock-free의 토대입니다.
volatile, atomic, memory barrier는 셋 다 별개의 개념입니다.

다음 편에서는 Semaphore 개념으로 Counting과 Binary, 사용 패턴을 다룹니다.

#관련 항목

Practical RTOS Internals · 7 of 53

Queue 내부 구현 추적 — Ring Buffer·2 Wait Lists·Atomic Send/Receive

FreeRTOS Queue 코드 — pcWriteTo·pcReadFrom·uxMessagesWaiting + xTasksWaitingToSend/Receive.

2026년 5월 6일·queue

Mutex 내부 구현 추적 — Owner·Recursion Count·ISR 금지

Mutex = Semaphore + pxMutexHolder + uxBasePriority. Recursive variant는 lock-count.

2026년 5월 6일·mutex

Critical Section 구현 비교 — IRQ Disable·BASEPRI·Spinlock

3 가지 구현 — cpsid/BASEPRI mask, taskENTER_CRITICAL, SMP spinlock. Hold time이 latency 결정.

2026년 5월 6일·critical-section

동기화 기초 분석 — Critical Section·Mutual Exclusion·Race Condition

#한 줄 요약

#Race Condition — 가장 까다로운 버그

#Critical Section — 보호 구간

#1. Interrupt Disable

#장단점

#2. Spinlock (SMP)

#장단점

#3. Mutex (Task 간)

#장단점

#선택 기준

#Atomic Operations

#Memory Ordering — Reordering 함정

#Critical Section 길이 — 권장

#Volatile — 동기화 ≠

#FreeRTOS API 요약

#자주 하는 실수

#정리

#관련 항목

Practical RTOS Internals · 7 of 53

관련 글

Queue 내부 구현 추적 — Ring Buffer·2 Wait Lists·Atomic Send/Receive

Mutex 내부 구현 추적 — Owner·Recursion Count·ISR 금지

Critical Section 구현 비교 — IRQ Disable·BASEPRI·Spinlock

이 글을 참조하는 글 (3)

#한 줄 요약

#Race Condition — 가장 까다로운 버그

#Critical Section — 보호 구간

#1. Interrupt Disable

#장단점

#2. Spinlock (SMP)

#장단점

#3. Mutex (Task 간)

#장단점

#선택 기준

#Atomic Operations

#Memory Ordering — Reordering 함정

#False Sharing — Cache 함정

#Critical Section 길이 — 권장

#Volatile — 동기화 ≠

#FreeRTOS API 요약

#자주 하는 실수

#정리

#관련 항목

Practical RTOS Internals · 7 of 53

관련 글

Queue 내부 구현 추적 — Ring Buffer·2 Wait Lists·Atomic Send/Receive

Mutex 내부 구현 추적 — Owner·Recursion Count·ISR 금지

Critical Section 구현 비교 — IRQ Disable·BASEPRI·Spinlock

이 글을 참조하는 글 (3)