Unsafe Rust and FFI §§ZH§§ unsafe Rust 与 FFI - Rust for Python Programmers

When and Why to Use Unsafe
什么时候该用 `unsafe`，为什么会需要它

What you’ll learn: What unsafe permits and why it exists, writing Python extensions with PyO3, Rust’s testing framework vs pytest, mocking with mockall, and benchmarking basics.
本章将学习： unsafe 允许做什么、它存在的原因、如何用 PyO3 编写 Python 扩展、Rust 测试框架与 pytest 的对应关系、如何用 mockall 做 mock，以及基准测试的基础思路。

Difficulty: 🔴 Advanced
难度： 🔴 高级

unsafe in Rust is an escape hatch. It tells the compiler: “This part cannot be fully verified automatically, but the invariants are still my responsibility.” Python has no direct equivalent because Python never hands over raw memory access in the same way.
Rust 里的 unsafe 本质上是一扇逃生门，意思是：“这一小段编译器没法完全替着验证，但相关不变量依然要由代码作者负责。”Python 没有完全对应的东西，因为它本来就很少把原始内存访问权直接交出来。

flowchart TB
    subgraph Safe ["Safe Rust (99% of code)<br/>安全 Rust"]
        S1["Your application logic<br/>业务逻辑"]
        S2["pub fn safe_api(&self) -> Result<br/>安全 API"]
    end
    subgraph Unsafe ["unsafe block (minimal, audited)<br/>最小化且可审计的 unsafe 块"]
        U1["Raw pointer dereference<br/>裸指针解引用"]
        U2["FFI call to C/Python<br/>调用 C 或 Python FFI"]
    end
    subgraph External ["External (C / Python / OS)<br/>外部系统"]
        E1["libc / PyO3 / system calls"]
    end
    S1 --> S2
    S2 --> U1
    S2 --> U2
    U1 --> E1
    U2 --> E1
    style Safe fill:#d4edda,stroke:#28a745
    style Unsafe fill:#fff3cd,stroke:#ffc107
    style External fill:#f8d7da,stroke:#dc3545

The pattern: a small unsafe block sits behind a safe API, and callers never have to touch unsafe themselves. Python’s ctypes world is much blurrier; every boundary call is effectively risky by default.
典型模式： 把一小段 unsafe 藏在安全 API 后面，调用方根本看不见 unsafe。相比之下，Python 的 ctypes 边界通常更模糊，每一次跨边界调用都默认带着风险。

📌 See also: Ch. 13 — Concurrency introduces Send and Sync, which are unsafe auto-traits checked by the compiler to keep threaded code sound.
📌 延伸阅读： 第 13 章——并发里提到的 Send 和 Sync，本质上也是和 unsafe 语义紧密相关的自动 trait，编译器会借它们保证并发代码的正确性。

What `unsafe` Allows
`unsafe` 允许做什么

// unsafe lets you do FIVE things that safe Rust forbids:
// 1. Dereference raw pointers
// 2. Call unsafe functions/methods
// 3. Access mutable static variables
// 4. Implement unsafe traits
// 5. Access union fields

// Example: calling a C function
extern "C" {
    fn abs(input: i32) -> i32;
}

fn main() {
    // SAFETY: abs() is a well-defined C standard library function.
    let result = unsafe { abs(-42) };  // Safe Rust can't verify C code
    println!("{result}");               // 42
}

When to Use `unsafe`
什么时候该用 `unsafe`

#![allow(unused)]
fn main() {
// 1. FFI — calling C libraries (most common reason)
// 2. Performance-critical inner loops (rare)
// 3. Data structures the borrow checker can't express (rare)

// As a Python developer, you'll mostly encounter unsafe in:
// - PyO3 internals (Python ↔ Rust bridge)
// - C library bindings
// - Low-level system calls

// Rule of thumb: if you're writing application code (not library code),
// you should almost never need unsafe. If you think you do, ask in the
// Rust community first — there's usually a safe alternative.
}

日常应用代码里，unsafe 的出场率应该低得可怜。要是三天两头就想往里加 unsafe，多半是设计先跑偏了，不是编译器太严格。
In day-to-day application code, unsafe should be rare. If it starts showing up everywhere, that is usually a sign that the design needs rethinking rather than that Rust is “being too strict.”

PyO3: Rust Extensions for Python
PyO3：给 Python 写 Rust 扩展

PyO3 is the main bridge between Python and Rust. It lets Rust functions and types appear as ordinary Python-callable modules, which is exactly what many Python developers need when speeding up hot paths.
PyO3 是 Python 和 Rust 之间最常用的桥。它能让 Rust 函数和类型直接变成 Python 可调用模块，这对想给 Python 热点逻辑提速的人来说非常实用。

Creating a Python Extension in Rust
用 Rust 创建 Python 扩展

# Setup
pip install maturin    # Build tool for Rust Python extensions
maturin init           # Creates project structure

# Project structure:
# my_extension/
# ├── Cargo.toml
# ├── pyproject.toml
# └── src/
#     └── lib.rs

# Cargo.toml
[package]
name = "my_extension"
version = "0.1.0"
edition = "2021"

[lib]
crate-type = ["cdylib"]    # Shared library for Python

[dependencies]
pyo3 = { version = "0.22", features = ["extension-module"] }

#![allow(unused)]
fn main() {
// src/lib.rs — Rust functions callable from Python
use pyo3::prelude::*;

/// A fast Fibonacci function written in Rust.
#[pyfunction]
fn fibonacci(n: u64) -> u64 {
    let (mut a, mut b) = (0u64, 1u64);
    for _ in 0..n {
        let temp = b;
        b = a.wrapping_add(b);
        a = temp;
    }
    a
}

/// Find all prime numbers up to n (Sieve of Eratosthenes).
#[pyfunction]
fn primes_up_to(n: usize) -> Vec<usize> {
    let mut is_prime = vec![true; n + 1];
    is_prime[0] = false;
    if n > 0 { is_prime[1] = false; }
    for i in 2..=((n as f64).sqrt() as usize) {
        if is_prime[i] {
            for j in (i * i..=n).step_by(i) {
                is_prime[j] = false;
            }
        }
    }
    (2..=n).filter(|&i| is_prime[i]).collect()
}

/// A Rust class usable from Python.
#[pyclass]
struct Counter {
    value: i64,
}

#[pymethods]
impl Counter {
    #[new]
    fn new(start: i64) -> Self {
        Counter { value: start }
    }

    fn increment(&mut self) {
        self.value += 1;
    }

    fn get_value(&self) -> i64 {
        self.value
    }

    fn __repr__(&self) -> String {
        format!("Counter(value={})", self.value)
    }
}

/// The Python module definition.
#[pymodule]
fn my_extension(m: &Bound<'_, PyModule>) -> PyResult<()> {
    m.add_function(wrap_pyfunction!(fibonacci, m)?)?;
    m.add_function(wrap_pyfunction!(primes_up_to, m)?)?;
    m.add_class::<Counter>()?;
    Ok(())
}
}

Using from Python
在 Python 里使用

# Build and install:
maturin develop --release   # Builds and installs into current venv

# Python — use the Rust extension like any Python module
import my_extension

# Call Rust function
result = my_extension.fibonacci(50)
print(result)  # 12586269025 — computed in microseconds

# Use Rust class
counter = my_extension.Counter(0)
counter.increment()
counter.increment()
print(counter.get_value())  # 2
print(counter)              # Counter(value=2)

# Performance comparison:
import time

# Python version
def py_primes(n):
    sieve = [True] * (n + 1)
    for i in range(2, int(n**0.5) + 1):
        if sieve[i]:
            for j in range(i*i, n+1, i):
                sieve[j] = False
    return [i for i in range(2, n+1) if sieve[i]]

start = time.perf_counter()
py_result = py_primes(10_000_000)
py_time = time.perf_counter() - start

start = time.perf_counter()
rs_result = my_extension.primes_up_to(10_000_000)
rs_time = time.perf_counter() - start

print(f"Python: {py_time:.3f}s")    # ~3.5s
print(f"Rust:   {rs_time:.3f}s")    # ~0.05s — 70x faster!
print(f"Same results: {py_result == rs_result}")  # True

PyO3 Quick Reference
PyO3 速查表

Python Concept	PyO3 Attribute	Notes 说明
Function	`#[pyfunction]`	Exposed to Python 暴露给 Python
Class	`#[pyclass]`	Python-visible class Python 可见类
Method	`#[pymethods]`	Methods on a pyclass 类方法集合
`__init__`	`#[new]`	Constructor 构造函数
`__repr__`	`fn __repr__()`	String representation 调试字符串表示
`__str__`	`fn __str__()`	Display string 显示字符串
`__len__`	`fn __len__()`	Length 长度
`__getitem__`	`fn __getitem__()`	Indexing 下标访问
Property	`#[getter]` / `#[setter]`	Attribute access 属性访问
Static method	`#[staticmethod]`	No self 无 `self`
Class method	`#[classmethod]`	Takes `cls` 接收类对象

FFI Safety Patterns
FFI 安全模式

When exposing Rust to Python or C, these rules avoid the most common disasters.
当 Rust 要暴露给 Python 或 C 使用时，下面这些规则能挡住最常见的爆炸现场。

Never let a panic cross the FFI boundary. PyO3 handles this for #[pyfunction], but raw extern "C" functions must catch panics manually.

#![allow(unused)]
fn main() {
#[no_mangle]
pub extern "C" fn raw_ffi_function() -> i32 {
    match std::panic::catch_unwind(|| {
        // actual logic
        42
    }) {
        Ok(result) => result,
        Err(_) => -1,  // Return error code instead of panicking into C/Python
    }
}
}

1. 绝对别让 panic 穿过 FFI 边界。 #[pyfunction] 这类接口 PyO3 会帮着兜住，但裸写 extern "C" 时，必须手动 catch_unwind。

Use #[repr(C)] for shared structs when another language reads the fields directly. Otherwise the内存布局就没有稳定保证。

2. 跨语言共享字段布局时，一定加 #[repr(C)]。 如果对方语言要直接按字段读结构体，没这个注解就别指望布局稳定。

Use extern "C" for raw FFI functions so the calling convention matches. PyO3 hides this for Python-facing functions.

3. 原始 FFI 函数要用 extern "C"。 这样调用约定才一致。PyO3 对面向 Python 的函数会把这些底层细节包起来。

PyO3 advantage: PyO3 automatically handles panic conversion, type marshalling, and much of the GIL interaction. Unless there is a very specific low-level requirement, it is far better than hand-rolled raw FFI.
PyO3 的优势： 它会替着处理 panic 转换、类型转换和大量 GIL 相关细节。除非真有非常底层的特殊要求，否则一般都比手搓裸 FFI 舒服得多。

Unit Tests vs pytest
单元测试与 pytest 对照

Python Testing with pytest
Python 里用 pytest

# test_calculator.py
import pytest
from calculator import add, divide

def test_add():
    assert add(2, 3) == 5

def test_add_negative():
    assert add(-1, 1) == 0

def test_divide():
    assert divide(10, 2) == 5.0

def test_divide_by_zero():
    with pytest.raises(ZeroDivisionError):
        divide(1, 0)

# Parameterized tests
@pytest.mark.parametrize("a,b,expected", [
    (1, 2, 3),
    (0, 0, 0),
    (-1, -1, -2),
    (100, 200, 300),
])
def test_add_parametrized(a, b, expected):
    assert add(a, b) == expected

# Fixtures
@pytest.fixture
def sample_data():
    return [1, 2, 3, 4, 5]

def test_sum(sample_data):
    assert sum(sample_data) == 15

# Running tests
pytest                      # Run all tests
pytest test_calculator.py   # Run one file
pytest -k "test_add"        # Run matching tests
pytest -v                   # Verbose output
pytest --tb=short           # Short tracebacks

Rust Built-in Testing
Rust 内建测试框架

#![allow(unused)]
fn main() {
// src/calculator.rs — tests live in the SAME file!
fn add(a: i32, b: i32) -> i32 {
    a + b
}

fn divide(a: f64, b: f64) -> Result<f64, String> {
    if b == 0.0 {
        Err("Division by zero".to_string())
    } else {
        Ok(a / b)
    }
}

// Tests go in a #[cfg(test)] module — only compiled during `cargo test`
#[cfg(test)]
mod tests {
    use super::*;  // Import everything from parent module

    #[test]
    fn test_add() {
        assert_eq!(add(2, 3), 5);
    }

    #[test]
    fn test_add_negative() {
        assert_eq!(add(-1, 1), 0);
    }

    #[test]
    fn test_divide() {
        assert_eq!(divide(10.0, 2.0), Ok(5.0));
    }

    #[test]
    fn test_divide_by_zero() {
        assert!(divide(1.0, 0.0).is_err());
    }

    // Test that something panics (like pytest.raises)
    #[test]
    #[should_panic(expected = "out of bounds")]
    fn test_out_of_bounds() {
        let v = vec![1, 2, 3];
        let _ = v[99];  // Panics
    }
}
}

# Running tests
cargo test                         # Run all tests
cargo test test_add                # Run matching tests
cargo test -- --nocapture          # Show println! output
cargo test -p my_crate             # Test one crate in workspace
cargo test -- --test-threads=1     # Sequential (for tests with side effects)

Testing Quick Reference
测试速查表

pytest	Rust	Notes 说明
`assert x == y`	`assert_eq!(x, y)`	Equality 相等断言
`assert x != y`	`assert_ne!(x, y)`	Inequality 不相等断言
`assert condition`	`assert!(condition)`	Boolean check 布尔条件断言
`assert condition, "msg"`	`assert!(condition, "msg")`	With message 带消息
`pytest.raises(E)`	`#[should_panic]`	Expect panic 预期 panic
`@pytest.fixture`	Helper setup code	No built-in fixture system 标准库没有完全对等的 fixture 机制
`@pytest.mark.parametrize`	`rstest` crate	Parameterized tests 参数化测试
`conftest.py`	`tests/common/mod.rs`	Shared helpers 共享测试辅助
`pytest.skip()`	`#[ignore]`	Skip a test 跳过测试
`tmp_path` fixture	`tempfile` crate	Temporary directories 临时目录

Parameterized Tests with rstest
用 `rstest` 写参数化测试

#![allow(unused)]
fn main() {
// Cargo.toml: rstest = "0.23"

use rstest::rstest;

// Like @pytest.mark.parametrize
#[rstest]
#[case(1, 2, 3)]
#[case(0, 0, 0)]
#[case(-1, -1, -2)]
#[case(100, 200, 300)]
fn test_add(#[case] a: i32, #[case] b: i32, #[case] expected: i32) {
    assert_eq!(add(a, b), expected);
}

// Like @pytest.fixture
use rstest::fixture;

#[fixture]
fn sample_data() -> Vec<i32> {
    vec![1, 2, 3, 4, 5]
}

#[rstest]
fn test_sum(sample_data: Vec<i32>) {
    assert_eq!(sample_data.iter().sum::<i32>(), 15);
}
}

Mocking with mockall
用 mockall 做 mock

# Python — mocking with unittest.mock
from unittest.mock import Mock, patch

def test_fetch_user():
    mock_db = Mock()
    mock_db.get_user.return_value = {"name": "Alice"}

    result = fetch_user_name(mock_db, 1)
    assert result == "Alice"
    mock_db.get_user.assert_called_once_with(1)

#![allow(unused)]
fn main() {
// Rust — mocking with mockall crate
// Cargo.toml: mockall = "0.13"

use mockall::{automock, predicate::*};

#[automock]                          // Generates MockDatabase automatically
trait Database {
    fn get_user(&self, id: i64) -> Option<User>;
}

fn fetch_user_name(db: &dyn Database, id: i64) -> Option<String> {
    db.get_user(id).map(|u| u.name)
}

#[test]
fn test_fetch_user() {
    let mut mock = MockDatabase::new();
    mock.expect_get_user()
        .with(eq(1))                   // assert_called_with(1)
        .times(1)                      // assert_called_once
        .returning(|_| Some(User { name: "Alice".into() }));

    let result = fetch_user_name(&mock, 1);
    assert_eq!(result, Some("Alice".to_string()));
}
}

mockall 和 Python 的 unittest.mock 不同，它不是到处临时打补丁，而是鼓励先把依赖抽象成 trait，再对 trait 生成 mock。麻烦一点，但结构更健康。
mockall works differently from Python’s unittest.mock. Instead of patching symbols all over the place, it encourages modeling dependencies as traits and generating mocks from those interfaces, which is more explicit and usually healthier for the design.

Exercises
练习

🏋️ Exercise: Safe Wrapper Around Unsafe
🏋️ 练习：给 `unsafe` 套一层安全包装

Challenge: Write a safe function split_at_mid that takes a &mut [i32] and returns two mutable slices split at the midpoint. Internally, use raw pointers and unsafe, similar to how split_at_mut works, but present a safe API to callers.
挑战：写一个安全函数 split_at_mid，接收 &mut [i32] 并在中点拆成两个可变切片返回。内部允许使用裸指针和 unsafe，模仿 split_at_mut 的实现思路，但对调用方暴露的接口必须是安全的。

🔑 Solution
🔑 参考答案

fn split_at_mid(slice: &mut [i32]) -> (&mut [i32], &mut [i32]) {
    let mid = slice.len() / 2;
    let ptr = slice.as_mut_ptr();
    let len = slice.len();

    assert!(mid <= len); // Safety check before unsafe

    unsafe {
        // SAFETY: mid <= len (asserted above), and ptr comes from a valid &mut slice,
        // so both sub-slices are within bounds and non-overlapping.
        (
            std::slice::from_raw_parts_mut(ptr, mid),
            std::slice::from_raw_parts_mut(ptr.add(mid), len - mid),
        )
    }
}

fn main() {
    let mut data = vec![1, 2, 3, 4, 5, 6];
    let (left, right) = split_at_mid(&mut data);
    left[0] = 99;
    right[0] = 88;
    println!("left: {left:?}, right: {right:?}");
    // left: [99, 2, 3], right: [88, 5, 6]
}

Key takeaway: Keep the unsafe block tiny, guard it with explicit checks, and expose only a safe interface outward. That is the standard Rust pattern: unsafe internals, safe public API.
核心收获： unsafe 块要尽量小，前面要有明确校验，对外只暴露安全接口。这就是 Rust 处理不安全代码的标准套路：内部不安全，外部安全。

Keyboard shortcuts

Rust for Python Programmers | Rust 面向 Python 程序员