Robotics Math Workbook — Engineermaxxing

Chapter 0: 2D Rotation Matrices

You're programming a robot arm and need to figure out where the tip of the arm ends up after a joint rotates. Everything starts with this: how do you rotate a point in 2D?

A 2D rotation matrix R(θ) rotates a point counter-clockwise by angle θ around the origin:

R(θ) =
[ cosθ -sinθ ]
[ sinθ    cosθ ]

To rotate point p = (x, y):
p' = R(θ) · p

Key properties:
det(R) = cos²θ + sin²θ = 1    // preserves area
R(θ)^-1 = R(θ)^T = R(-θ)    // orthogonal matrix
R(α) · R(β) = R(α + β)    // rotations compose by addition

The rotation matrix is orthogonal. Its inverse equals its transpose. This means "undo a rotation" is free — just transpose the matrix. No matrix inversion needed. Every rotation matrix has determinant +1 and belongs to the Special Orthogonal group SO(2).

Exercise 0.1: Rotate a Point by 90° Derive

Rotate the point (3, 0) counter-clockwise by 90°. What is the y-coordinate of the result?

R(90°) = [[cos90, -sin90], [sin90, cos90]] = [[0, -1], [1, 0]]. Multiply R · [3, 0]^T.

y-coordinate

Show derivation

R(90°) · [3, 0]^T = [[0, -1], [1, 0]] · [3, 0]^T

x' = 0×3 + (-1)×0 = 0

y' = 1×3 + 0×0 = 3

The point (3, 0) on the positive x-axis moves to (0, 3) on the positive y-axis — exactly a quarter turn counter-clockwise.

Exercise 0.2: Compose Two Rotations Derive

Compute the (1,1) entry (top-left) of R(30°) × R(60°). Express as a decimal.

Use R(α)R(β) = R(α+β). The (1,1) entry of a rotation matrix is cos of the total angle.

(1,1) entry

Show derivation

R(30°) × R(60°) = R(30° + 60°) = R(90°)

(1,1) entry = cos(90°) = 0

Rotation composition is just angle addition. This is why rotations in 2D are so simple — they commute (R(a)R(b) = R(b)R(a)). In 3D, rotations do not commute, which is what makes 3D orientation so much harder.

Exercise 0.3: Inverse = Transpose Trace

You rotated a point by R(45°) and need to undo the rotation. Which operation recovers the original point?

Multiply by R(45°) again Multiply by R(45°)^T = R(-45°) Subtract 45° from each component Compute the full matrix inverse via cofactors

Show explanation

Because R is orthogonal, R^-1 = R^T. The transpose of R(θ) is R(-θ) — rotating by the negative angle. No need for expensive matrix inversion. This property holds for ALL rotation matrices in ANY dimension.

Exercise 0.4: Rotate (1, 1) by 45° Derive

Rotate the point (1, 1) counter-clockwise by 45°. What is the x-coordinate of the result? (cos45° = sin45° = √2/2 ≈ 0.7071)

x-coordinate

Show derivation

R(45°) = [[0.7071, -0.7071], [0.7071, 0.7071]]

x' = 0.7071 × 1 + (-0.7071) × 1 = 0

y' = 0.7071 × 1 + 0.7071 × 1 = 1.4142 = √2

The point (1,1) is at 45° from the x-axis at distance √2. Rotating it 45° more places it at 90° from the x-axis — directly on the y-axis at (0, √2). The distance from the origin is preserved: √(0² + (√2)²) = √2 = √(1² + 1²).

Exercise 0.5: Implement rotate2D() Build

Write a function that rotates a 2D point by an angle (in radians). Return [x', y'].

Return an array [x', y']. Use Math.cos() and Math.sin().

Show solution

javascript
function rotate2D(x, y, theta) {
  const c = Math.cos(theta);
  const s = Math.sin(theta);
  return [c * x - s * y, s * x + c * y];
}

Exercise 0.6: Rotation Determinant Derive

What is the determinant of R(37°)? (No calculator needed.)

det(R)

Show derivation

det(R(θ)) = cos²θ + sin²θ = 1

This is true for ALL rotation matrices, regardless of angle. det = 1 means the rotation preserves area and orientation (no reflection). This is the defining property of SO(2) — Special Orthogonal group in 2D.

Chapter 1: Homogeneous Transforms

A robot joint rotates AND translates. Rotation matrices alone can't represent translation. How do you combine both into a single matrix multiply? The answer is homogeneous coordinates.

2D Homogeneous Transform (3×3):
T = [ R t ] = [ cosθ -sinθ t_x ]
[ 0 1 ] [ sinθ cosθ t_y ]
[ 0 0 1 ]

Transform a point: append 1 to make it homogeneous
[x', y', 1]^T = T · [x, y, 1]^T

Chain transforms:
T_{world_point} = T_{world_robot} · T_{robot_camera} · T_{camera_point}

Read right to left. T_{world_robot} · T_{robot_camera} · p_camera means: start in camera frame, transform to robot frame, then transform to world frame. Each T converts FROM the right subscript TO the left subscript.

Exercise 1.1: Pure Translation Derive

A robot is at position (3, 4) with no rotation (θ=0). Transform the point (1, 0) from robot frame to world frame. What is the x-coordinate in world frame?

x (world)

Show derivation

T_{world_robot} = [[1, 0, 3], [0, 1, 4], [0, 0, 1]] // θ=0, t=(3,4)

[x', y', 1]^T = T · [1, 0, 1]^T

x' = 1×1 + 0×0 + 3×1 = 4

y' = 0×1 + 1×0 + 4×1 = 4

With zero rotation, the transform is just a translation: add the robot position to the point. (1,0) in robot frame becomes (4,4) in world frame.

Exercise 1.2: Rotation + Translation Derive

Robot at position (2, 1), heading θ=90°. A point at (1, 0) in the robot's frame. What is the y-coordinate in world frame?

Build T with R(90°) and t=(2,1). Multiply T · [1, 0, 1]^T.

y (world)

Show derivation

T = [[cos90, -sin90, 2], [sin90, cos90, 1], [0, 0, 1]]

T = [[0, -1, 2], [1, 0, 1], [0, 0, 1]]

x' = 0×1 + (-1)×0 + 2 = 2

y' = 1×1 + 0×0 + 1 = 2

The robot faces 90° (along the y-axis). A point 1 unit "ahead" in robot frame (robot's x-axis) is actually 1 unit along the world's y-axis. So world position = (2, 1+1) = (2, 2).

Exercise 1.3: Chaining Transforms Derive

A robot arm has: T_{world_base} (translate by (0, 1), no rotation) and T_{base_tool} (translate by (2, 0), rotate 90°). A point at (1, 0) in the tool frame. What is the x-coordinate in the world frame?

Compute T_{world_base} · T_{base_tool} · [1, 0, 1]^T. Work inside out: first transform from tool to base, then base to world.

x (world)

Show derivation

T_{base_tool} = [[0, -1, 2], [1, 0, 0], [0, 0, 1]] // rot 90°, trans (2,0)

p_base = T_{base_tool} · [1, 0, 1]^T = [0×1 + (-1)×0 + 2, 1×1 + 0×0 + 0, 1] = [2, 1, 1]

T_{world_base} = [[1, 0, 0], [0, 1, 1], [0, 0, 1]] // no rot, trans (0,1)

p_world = T_{world_base} · [2, 1, 1]^T = [2, 2, 1]

x_world = 2. The chain works inside out: point (1,0) in tool frame becomes (2,1) in base frame (the tool frame is rotated 90° and offset by 2), then (2,2) in world frame (base is offset by 1 in y).

Exercise 1.4: Transform Inverse Trace

You have T_{world_robot} and a point in world frame. To get the point in robot frame, you need:

Multiply by T_{world_robot} again Subtract the translation Multiply by T_{world_robot}^-1 = [[R^T, -R^Tt], [0, 1]] Transpose the entire T matrix

Show explanation

The inverse of a homogeneous transform is NOT just the transpose (that only works for pure rotation matrices). The inverse is T^-1 = [[R^T, -R^Tt], [0, 1]]. You rotate the translation vector by R^T and negate it. This is computationally cheap — no general matrix inversion needed.

Exercise 1.5: Transform Pipeline Design

A self-driving car sees a pedestrian. Arrange the steps to convert the pedestrian's position from LiDAR frame to GPS (world) frame.

→

Point in LiDAR frame T_{car_lidar} T_{world_car} Point in world frame

Show explanation

The pipeline reads left to right but the math reads right to left: p_world = T_{world_car} · T_{car_lidar} · p_lidar. Start with the point, apply the LiDAR-to-car transform, then the car-to-world transform.

Chapter 2: 3D Rotations

In 2D, rotations were simple — one angle, one axis. In 3D, everything gets harder. There are multiple representations (Euler angles, rotation matrices, quaternions, axis-angle), each with tradeoffs. And there's a trap called gimbal lock.

Elementary rotation matrices:
R_x(α) = [[1, 0, 0], [0, cosα, -sinα], [0, sinα, cosα]]
R_y(β) = [[cosβ, 0, sinβ], [0, 1, 0], [-sinβ, 0, cosβ]]
R_z(γ) = [[cosγ, -sinγ, 0], [sinγ, cosγ, 0], [0, 0, 1]]

Euler angles (ZYX convention): R = R_z(γ) · R_y(β) · R_x(α)

Quaternion: q = [w, x, y, z], ||q|| = 1
From axis-angle (axis n, angle θ): q = [cos(θ/2), sin(θ/2)·n_x, sin(θ/2)·n_y, sin(θ/2)·n_z]

Gimbal lock happens with Euler angles when the pitch (middle rotation) reaches ±90°. At that point, the first and third rotation axes align, and you lose one degree of freedom. The system can no longer represent arbitrary rotations. This is why aerospace, robotics, and game engines use quaternions instead.

Exercise 2.1: Rz(90°) Derive

A drone rotates 90° around the z-axis (yaw). What does R_z(90°) · [1, 0, 0]^T give? Enter the y-component.

y-component

Show derivation

R_z(90°) = [[0, -1, 0], [1, 0, 0], [0, 0, 1]]

R_z · [1, 0, 0]^T = [0, 1, 0]^T

The x-axis unit vector rotates to the y-axis unit vector. A 90° yaw maps "forward" to "left" (in a right-handed frame). The z-component stays 0 because R_z doesn't affect z.

Exercise 2.2: Gimbal Lock Trace

When using ZYX Euler angles, at what pitch angle β does gimbal lock occur?

β = 0° β = ±90° — the yaw and roll axes align β = 180° β = 45°

Show explanation

At β = 90°, cosβ = 0. Plug into R_z(γ)R_y(90°)R_x(α) and you get a matrix that only depends on γ+α, not on γ and α independently. Two parameters collapsed into one — you lost a degree of freedom. This is gimbal lock. The fix: use quaternions, which have no singular configurations.

Exercise 2.3: Quaternion for 90° about Z Derive

Convert the rotation "90° about the z-axis" to a unit quaternion q = [w, x, y, z]. What is the w component?

Axis = [0, 0, 1], angle = 90° = π/2. Use q = [cos(θ/2), sin(θ/2)·n].

w component

Show derivation

w = cos(θ/2) = cos(45°) = √2/2 ≈ 0.7071

x = sin(45°) × 0 = 0

y = sin(45°) × 0 = 0

z = sin(45°) × 1 = 0.7071

q = [0.7071, 0, 0, 0.7071]

Notice the half-angle: 90° rotation uses 45° in the quaternion. This is because quaternions double-cover SO(3) — both q and -q represent the same rotation. The norm is ||q|| = √(0.5 + 0 + 0 + 0.5) = 1, confirming it's a valid unit quaternion.

Exercise 2.4: Quaternion for 180° about X Derive

Convert "180° about the x-axis" to a quaternion. What is the z component?

z component

Show derivation

w = cos(90°) = 0

x = sin(90°) × 1 = 1

y = sin(90°) × 0 = 0

z = sin(90°) × 0 = 0

q = [0, 1, 0, 0]

z = 0 because the rotation axis is [1,0,0], which has no z component. A 180° rotation has w=0 (cos(90°)=0), meaning the quaternion is purely imaginary. This represents the "most different" orientation from identity (where q = [1,0,0,0]).

Exercise 2.5: Quaternion Norm Derive

Is q = [0.5, 0.5, 0.5, 0.5] a valid unit quaternion? Compute ||q||.

||q||

Show derivation

||q|| = √(0.5² + 0.5² + 0.5² + 0.5²) = √(0.25 × 4) = √1 = 1

Yes, it is a valid unit quaternion. It represents a 120° rotation about the axis [1,1,1]/√3. You can verify: w = cos(θ/2) = 0.5 implies θ/2 = 60°, so θ = 120°. The axis is [0.5, 0.5, 0.5]/sin(60°) = [0.5, 0.5, 0.5]/0.866 = [0.577, 0.577, 0.577] ≈ [1,1,1]/√3.

Chapter 3: EKF Prediction Step

Your robot drives forward and turns. You know the velocity and angular rate from the wheels. But the model is nonlinear — position update involves cos(θ) and sin(θ). The Extended Kalman Filter (EKF) handles this by linearizing around the current estimate.

State: x = [x, y, θ]^T    // robot pose in 2D

Motion model (velocity model):
x' = x + v · cos(θ) · dt
y' = y + v · sin(θ) · dt
θ' = θ + ω · dt

Jacobian F = ∂f/∂x:
F = [ 1 0 -v·sin(θ)·dt ]
    [ 0 1    v·cos(θ)·dt ]
    [ 0 0                1 ]

Covariance prediction: P^- = F · P · F^T + Q

Why the Jacobian? The Kalman filter needs a linear model (matrix multiply). The EKF takes the nonlinear model f(x, u) and computes its first-order Taylor expansion at the current state. The Jacobian F is that linear approximation. It tells you: "if the state changes by a small δx, the output changes by F · δx."

Exercise 3.1: Predict New Position Derive

Robot at x=[0, 0, 0] (origin, facing east). Drives with v=1 m/s, ω=0 rad/s, dt=1s. What is the new x-coordinate?

x (meters)

Show derivation

x' = 0 + 1 × cos(0) × 1 = 1

y' = 0 + 1 × sin(0) × 1 = 0

θ' = 0 + 0 × 1 = 0

Facing east (θ=0), cos(0)=1, sin(0)=0. The robot drives 1 meter along the x-axis. Simple — but the math gets interesting when θ ≠ 0.

Exercise 3.2: Predict with Rotation Derive

Robot at x=[0, 0, π/4] (facing 45°). Drives with v=2 m/s, ω=0 rad/s, dt=0.5s. What is the new y-coordinate?

y (meters)

Show derivation

y' = 0 + 2 × sin(π/4) × 0.5 = 1 × sin(45°) = 0.7071

x' = 0 + 2 × cos(π/4) × 0.5 = 0.7071

At 45°, the robot moves equally in x and y. It travels a distance of v×dt = 1 meter, split as (cos45°, sin45°) = (0.707, 0.707).

Exercise 3.3: Compute the Jacobian F Derive

Robot at θ=π/2 (facing north), v=1 m/s, dt=0.5s. What is the (1,3) entry of the Jacobian F? (Row 1, column 3.)

F(1,3) = -v · sin(θ) · dt.

F(1,3)

Show derivation

F(1,3) = -v · sin(θ) · dt = -1 × sin(π/2) × 0.5 = -1 × 1 × 0.5 = -0.5

This entry means: "if heading changes by a small δθ, the x-position prediction changes by -0.5 × δθ." When facing north (θ=π/2), a small increase in heading rotates the velocity vector slightly to the left, reducing x — hence the negative sign.

Exercise 3.4: Covariance Prediction P^- Derive

Given P = [[1,0,0],[0,1,0],[0,0,0.1]], Q = [[0.01,0,0],[0,0.01,0],[0,0,0.001]], and θ=0, v=1, dt=1. Compute P^- = FPF^T + Q. What is the (1,1) entry of P^-?

At θ=0: F = [[1,0,0],[0,1,1],[0,0,1]] (since -v·sin(0)·dt=0 and v·cos(0)·dt=1).

P^-(1,1)

Show derivation

F = [[1, 0, 0], [0, 1, 1], [0, 0, 1]]

FP = [[1,0,0],[0,1,0.1],[0,0,0.1]]

FPF^T = [[1,0,0],[0,1.1,0.1],[0,0.1,0.1]]

P^- = FPF^T + Q = [[1.01,0,0],[0,1.11,0.1],[0,0.1,0.101]]

P^-(1,1) = 1 + 0.01 = 1.01. The x uncertainty barely grew because at θ=0, x-position doesn't depend on θ (F(1,3)=0). But look at P^-(2,2) = 1.11: the y uncertainty grew more because F(2,3)=1, meaning y-prediction couples with θ-uncertainty.

Exercise 3.5: Find the Bug Debug

This EKF prediction has a bug. The robot always drifts right even when driving straight north. Click the buggy line.

function ekfPredict(x, y, theta, v, omega, dt) {
  const xNew = x + v * Math.cos(theta) * dt;
  const yNew = y + v * Math.cos(theta) * dt;
  const thetaNew = theta + omega * dt;
  return [xNew, yNew, thetaNew];
}

Show explanation

Line 3 is the bug. The y update uses Math.cos(theta) instead of Math.sin(theta). When θ=π/2 (facing north), cos(π/2)=0 so y doesn't change — the robot "drifts right" because x increases (cos is nonzero at other angles) while y stays stuck. The correct line: const yNew = y + v * Math.sin(theta) * dt;

Exercise 3.6: Jacobian Structure Trace

In the Jacobian F of the velocity motion model, why are the (1,1), (2,2), and (3,3) diagonal entries all equal to 1?

Because rotations preserve length Because we normalize the state Because ∂x'/∂x = 1 — the new state includes the old state plus a correction. The identity reflects this additive structure. Because the matrix must be invertible

Show explanation

The motion model is x' = x + Δx, y' = y + Δy, θ' = θ + Δθ. Taking ∂x'/∂x = 1 because x appears directly in x'. The Jacobian has the form F = I + (off-diagonal terms from the nonlinear corrections). This is the hallmark of an additive motion model.

Chapter 4: EKF Update Step

The predict step pushed our estimate forward in time. Now a sensor reading arrives — say, the range and bearing to a known landmark. The update step fuses this measurement with the prediction.

Measurement model (range + bearing to landmark at (l_x, l_y)):
h(x) = [ √((l_x-x)² + (l_y-y)²) ]    // range r
        [ atan2(l_y-y, l_x-x) - θ ]    // bearing φ

Jacobian H = ∂h/∂x at robot (x, y, θ), landmark (l_x, l_y):
Let d_x = l_x-x, d_y = l_y-y, q = d_x² + d_y²
H = [ -d_x/√q -d_y/√q     0 ]
    [ d_y/q    -d_x/q    -1 ]

Update equations:
Innovation: y = z - h(x^-)
S = H P^- H^T + R
K = P^- H^T S^-1
x⁺ = x^- + K · y
P⁺ = (I - K H) P^-

K is a trust slider. The Kalman gain K balances two sources of uncertainty: the prediction (P^-) and the measurement (R). If the measurement is very precise (R small), K is large and the update pulls strongly toward the measurement. If the prediction is very precise (P^- small), K is small and the measurement barely matters.

Exercise 4.1: Predicted Measurement Derive

Robot at (0, 0, 0), landmark at (3, 4). What is the predicted range h₁(x)?

meters

Show derivation

r = √((3-0)² + (4-0)²) = √(9 + 16) = √25 = 5

The classic 3-4-5 right triangle. The predicted range is simply the Euclidean distance from robot to landmark.

Exercise 4.2: Predicted Bearing Derive

Same setup: robot at (0, 0, 0), landmark at (3, 4). What is the predicted bearing in degrees?

φ = atan2(l_y-y, l_x-x) - θ = atan2(4, 3) - 0.

degrees

Show derivation

φ = atan2(4, 3) = 0.9273 rad = 53.13°

The landmark is at 53.13° from the robot's heading direction. Since the robot faces east (θ=0), this is the same as the absolute angle to the landmark.

Exercise 4.3: Compute H(1,1) Derive

Robot at (0,0,0), landmark at (3,4). Compute H(1,1), the partial derivative of range with respect to x.

H(1,1) = -d_x/√q where d_x=3, q=25.

H(1,1)

Show derivation

d_x = 3 - 0 = 3, d_y = 4 - 0 = 4, q = 9 + 16 = 25

H(1,1) = -d_x/√q = -3/5 = -0.6

This is -cos(φ) where φ is the bearing angle. The negative sign means: if the robot moves in +x direction, the range to the landmark decreases (because it's moving toward the landmark's x-component). The magnitude 0.6 = cos(53.13°) reflects that the landmark isn't directly ahead.

Exercise 4.4: Innovation Derive

Predicted measurement: h(x) = [5, 0.9273]. Actual measurement: z = [4.8, 0.95]. Compute the range innovation (first component of y = z - h(x)).

meters

Show derivation

y_range = z_range - h_range = 4.8 - 5.0 = -0.2

The measured range is 0.2m shorter than predicted. This means the landmark is closer than we thought, so the update step will pull the robot estimate toward the landmark. The innovation is the "surprise" — what the sensor told us that we didn't already know.

Exercise 4.5: Implement ekfUpdate() Build

Implement a simplified 1D EKF update. Given scalar state x, prediction covariance P, measurement z, measurement function h(x)=x, and measurement noise R, compute the updated state.

K = P/(P+R). x+ = x + K*(z-x). P+ = (1-K)*P.

Show solution

javascript
function ekfUpdate(x, P, z, R) {
  const K = P / (P + R);      // Kalman gain
  const xNew = x + K * (z - x); // updated state
  const PNew = (1 - K) * P;    // updated covariance
  return [xNew, PNew];
}

Exercise 4.6: Kalman Gain Extremes Trace

If the measurement noise R approaches zero (perfect sensor), what does the Kalman gain K = PH^T(HPH^T+R)^-1 approach in the scalar case (H=1)?

K → 0 (ignore measurement) K → 1 (trust measurement completely) K → 0.5 (equal trust) K → ∞

Show explanation

K = P/(P+R). As R → 0, K → P/P = 1. The updated state becomes x⁺ = x^- + 1 × (z - x^-) = z. A perfect sensor overrides the prediction entirely. Conversely, as R → ∞, K → 0 and the measurement is ignored. The Kalman gain is exactly the trust slider between prediction and measurement.

Chapter 5: SLAM Graph

SLAM (Simultaneous Localization and Mapping) is harder than EKF alone. You don't just filter — you build a graph of all past poses and optimize them jointly. Each node is a robot pose. Each edge is a constraint (odometry measurement or loop closure). The solution minimizes total error across all constraints.

Pose graph:
Nodes: x₁, x₂, ..., x_n each x_i = [x, y, θ]
Edges: z_ij = observed transform from pose i to pose j

Edge error:
e_ij = z_ij ⊖ (x_j ⊖ x_i)
= z_ij - T(x_i)^-1 · T(x_j)

Information matrix: Ω_ij = Σ_ij^-1

Total cost:
F(x) = ∑_ij e_ij^T Ω_ij e_ij

Optimize: x* = argmin F(x) via Gauss-Newton or Levenberg-Marquardt

Why graph optimization beats filtering. An EKF processes measurements one at a time and never looks back. A graph optimizer considers ALL measurements simultaneously, including loop closures that connect distant poses. When you recognize a place you visited 100 steps ago, the loop closure corrects not just the current pose but all intermediate poses. This is why graph SLAM produces consistent maps.

Exercise 5.1: Graph Variables Derive

A 2D pose graph has 100 robot poses, each with state [x, y, θ]. How many total optimization variables?

variables

Show derivation

100 poses × 3 DOF per pose = 300 variables

In 3D SLAM, each pose has 6 DOF (x, y, z, roll, pitch, yaw), so 100 poses = 600 variables. If landmarks are also optimized (as in full SLAM), add 2 variables per 2D landmark or 3 per 3D landmark.

Exercise 5.2: Odometry Error Derive

Pose x₁ = [0, 0, 0]. Odometry says we moved [1, 0, 0] (1m forward). Actual pose x₂ = [1.1, 0.05, 0.02]. What is the x-component of the error e = z₁₂ - (x₂ - x₁)?

meters

Show derivation

z₁₂ = [1, 0, 0] // odometry measurement

x₂ - x₁ = [1.1-0, 0.05-0, 0.02-0] = [1.1, 0.05, 0.02]

e_x = 1 - 1.1 = -0.1

The error is negative: the actual displacement was 0.1m more than the odometry predicted. The optimizer will try to reduce this error by adjusting x₁ and/or x₂. Weighted by Ω, larger errors in more-certain measurements get penalized more heavily.

Exercise 5.3: Information Matrix Derive

An odometry sensor has covariance Σ = [[0.01, 0, 0], [0, 0.01, 0], [0, 0, 0.001]]. What is the (1,1) entry of the information matrix Ω = Σ^-1?

Ω(1,1)

Show derivation

Since Σ is diagonal, Ω = Σ^-1 is also diagonal.

Ω(1,1) = 1/Σ(1,1) = 1/0.01 = 100

Higher information = more certainty. The x-position has Ω=100, while θ has Ω=1/0.001=1000. The angular measurement is 10x more informative, so angular errors are penalized 10x more heavily in the optimization.

Exercise 5.4: Loop Closure Impact Trace

A robot drives in a loop and recognizes the starting location after 50 poses. Adding this loop closure edge connects poses x₁ and x₅₀. What happens during optimization?

Only x₅₀ is corrected to match x₁ Only x₁ is corrected All poses x₁ through x₅₀ shift to distribute the accumulated drift evenly All poses are deleted and re-estimated from scratch

Show explanation

The graph optimizer adjusts ALL poses simultaneously. A loop closure reveals that drift has accumulated over 50 steps. Rather than slamming x₅₀ onto x₁ (which would violate all the odometry constraints), the optimizer spreads the correction across all intermediate poses. Each pose shifts a little, and the total cost is minimized. This is the power of graph-based SLAM over filtering.

Exercise 5.5: System Size Derive

A 3D SLAM problem has 1000 poses (6 DOF each) and 500 landmarks (3 DOF each). The Gauss-Newton step solves HΔx = b where H is a square matrix. What is the dimension of H?

rows (= columns)

Show derivation

Poses: 1000 × 6 = 6000 variables

Landmarks: 500 × 3 = 1500 variables

Total = 7500 → H is 7500 × 7500

That's 56 million entries. But H is extremely sparse — each edge only connects two nodes, so each row has at most 6+6=12 nonzero entries. Sparse linear solvers like CHOLMOD exploit this sparsity to solve in milliseconds, not hours.

Chapter 6: 2-Link Planar Arm IK

A robotic arm with two links needs to reach a target. Forward kinematics (FK) tells you: given joint angles, where is the end effector? Inverse kinematics (IK) asks the reverse: given a target position, what joint angles do we need?

Forward kinematics (2-link arm):
x = L₁·cos(θ₁) + L₂·cos(θ₁ + θ₂)
y = L₁·sin(θ₁) + L₂·sin(θ₁ + θ₂)

Inverse kinematics (elbow-up solution):
cos(θ₂) = (x² + y² - L₁² - L₂²) / (2 L₁ L₂)
θ₂ = atan2(±√(1 - cos²θ₂), cosθ₂)
θ₁ = atan2(y, x) - atan2(L₂·sinθ₂, L₁ + L₂·cosθ₂)

Reachable workspace: |L₁ - L₂| ≤ √(x²+y²) ≤ L₁ + L₂

Two solutions, one target. For most reachable positions, there are exactly TWO IK solutions — "elbow up" and "elbow down." The ± in the θ₂ formula gives both. At the workspace boundary (fully extended or fully folded), the two solutions collapse to one.

Exercise 6.1: FK — Straight Arm Derive

L₁=L₂=1, θ₁=0, θ₂=0 (arm fully extended along x-axis). What is the x-coordinate of the end effector?

x (meters)

Show derivation

x = 1·cos(0) + 1·cos(0+0) = 1 + 1 = 2

y = 1·sin(0) + 1·sin(0+0) = 0 + 0 = 0

Both links point along the x-axis, so the end effector is at (2, 0). This is the maximum reach: L₁ + L₂ = 2.

Exercise 6.2: FK — Bent Arm Derive

L₁=L₂=1, θ₁=30°, θ₂=45°. What is the x-coordinate of the end effector? (Give 4 decimal places.)

cos(30°)=0.8660, sin(30°)=0.5, cos(75°)=0.2588, sin(75°)=0.9659.

x (meters)

Show derivation

x = cos(30°) + cos(30° + 45°) = cos(30°) + cos(75°)

x = 0.8660 + 0.2588 = 1.1248

y = sin(30°) + sin(75°) = 0.5 + 0.9659 = 1.4659

The first link points at 30°, and the second link at 30°+45°=75° (angles are cumulative). The end effector is at (1.125, 1.466), about 1.85m from the origin — less than the max reach of 2m because the arm is bent.

Exercise 6.3: IK — Solve θ₂ Derive

L₁=L₂=1, target at (1, 1). Compute cos(θ₂). What is θ₂ in degrees?

degrees (θ₂)

Show derivation

cos(θ₂) = (x² + y² - L₁² - L₂²) / (2 L₁ L₂)

= (1 + 1 - 1 - 1) / (2 × 1 × 1) = 0/2 = 0

θ₂ = acos(0) = 90°

The elbow bends at exactly 90° to reach (1, 1). This makes geometric sense: the target is at distance √2 from the origin, and two unit-length links forming a right angle span exactly √(1²+1²) = √2.

Exercise 6.4: Workspace Boundary Trace

For a 2-link arm with L₁=2, L₂=1: which target is unreachable?

(2, 0) — distance 2, within [1, 3] (1.5, 1.5) — distance 2.12, within [1, 3] (3, 1) — distance 3.16, exceeds L₁+L₂=3 (0, 1.5) — distance 1.5, within [1, 3]

Show explanation

(3, 1) is at distance √(9+1) = √10 ≈ 3.16, which exceeds the maximum reach L₁+L₂ = 3. Even fully extended, the arm can only reach 3 meters. The workspace is an annulus (donut) with inner radius |L₁-L₂|=1 and outer radius L₁+L₂=3.

Exercise 6.5: Implement forwardKinematics2Link() Build

Write a function that computes the end-effector position of a 2-link planar arm. Angles are in radians.

x = L1*cos(θ1) + L2*cos(θ1+θ2), y = L1*sin(θ1) + L2*sin(θ1+θ2).

Show solution

javascript
function forwardKinematics2Link(L1, L2, theta1, theta2) {
  const x = L1 * Math.cos(theta1) + L2 * Math.cos(theta1 + theta2);
  const y = L1 * Math.sin(theta1) + L2 * Math.sin(theta1 + theta2);
  return [x, y];
}

Chapter 7: Jacobian and Velocity

You can compute positions with FK and IK. But what about velocities? If a joint rotates at 1 rad/s, how fast does the end effector move? The Jacobian matrix maps joint velocities to end-effector velocities.

Velocity relationship:
[ẋ, ẏ]^T = J(θ) · [θ̇₁, θ̇₂]^T

Jacobian for 2-link arm:
J = [ ∂x/∂θ₁ ∂x/∂θ₂ ]
[ ∂y/∂θ₁ ∂y/∂θ₂ ]

J = [ -L₁s₁-L₂s₁₂ -L₂s₁₂ ]
[ L₁c₁+L₂c₁₂ L₂c₁₂ ]

where s₁=sinθ₁, c₁=cosθ₁, s₁₂=sin(θ₁+θ₂), c₁₂=cos(θ₁+θ₂)

Singularity: det(J) = 0 ⇒ arm loses a DOF
det(J) = L₁L₂sin(θ₂) = 0 when θ₂ = 0 or π

Singularity = arm stuck. When θ₂=0 (arm fully extended) or θ₂=π (fully folded), det(J)=0. The arm can't move in the radial direction — it can only swing sideways. At a singularity, the inverse Jacobian doesn't exist, so velocity-based IK (J^-1 · v_desired) fails. This is the kinematic equivalent of dividing by zero.

Exercise 7.1: Jacobian at θ=(0, 90°) Derive

L₁=L₂=1, θ₁=0, θ₂=90°. Compute J(1,1) — top-left entry of the Jacobian.

J(1,1) = -L₁sin(θ₁) - L₂sin(θ₁+θ₂). sin(0)=0, sin(90°)=1.

J(1,1)

Show derivation

J(1,1) = -L₁sin(0) - L₂sin(0+90°) = 0 - 1×1 = -1

J(1,2) = -L₂sin(90°) = -1

J(2,1) = L₁cos(0) + L₂cos(90°) = 1 + 0 = 1

J(2,2) = L₂cos(90°) = 0

J = [[-1, -1], [1, 0]]

Rotating joint 1 moves the end effector in the (-x, +y) direction. Rotating joint 2 moves it in the -x direction only. The determinant is (-1)(0) - (-1)(1) = 1 ≠ 0, so we're not at a singularity.

Exercise 7.2: End-Effector Velocity Derive

Using J = [[-1, -1], [1, 0]] from the previous exercise. Joint velocities: θ̇₁=1 rad/s, θ̇₂=0. What is the x-velocity of the end effector?

m/s

Show derivation

[ẋ, ẏ] = J · [θ̇₁, θ̇₂] = [[-1,-1],[1,0]] · [1, 0]

ẋ = -1×1 + (-1)×0 = -1 m/s

ẏ = 1×1 + 0×0 = 1 m/s

Joint 1 rotating at 1 rad/s swings the end effector at (-1, 1) m/s. The speed is √(1+1) = √2 ≈ 1.41 m/s. The end effector moves along a circle, and the Jacobian gives the instantaneous linear velocity of that circular motion.

Exercise 7.3: Singularity Detection Derive

Compute det(J) for L₁=L₂=1, θ₂=0 (fully extended).

det(J)

Show derivation

det(J) = L₁ · L₂ · sin(θ₂) = 1 × 1 × sin(0) = 0

Zero determinant means the Jacobian is singular — it cannot be inverted. Physically, the fully extended arm can only move perpendicular to itself (sideways). It cannot move radially (extend or contract) because both joints produce the same motion direction. This is the kinematic equivalent of two equations being linearly dependent.

Exercise 7.4: Manipulability Derive

The manipulability measure is w = |det(J)| = |L₁L₂sin(θ₂)|. For L₁=L₂=1, at what θ₂ (in degrees) is manipulability maximized?

degrees

Show derivation

w = |sin(θ₂)| is maximized when θ₂ = 90° or -90°

w_max = |sin(90°)| = 1

Maximum manipulability at θ₂=90° means the arm can move equally well in all directions. This is the "sweet spot" for control — the arm is most agile with a right-angle elbow bend. Robot path planners try to keep joints away from singularities (θ₂=0 or 180°) to maintain good manipulability.

Exercise 7.5: Find the Bug Debug

This Jacobian computation produces wrong velocities. Click the buggy line.

function jacobian2Link(L1, L2, t1, t2) {
  const s1 = Math.sin(t1), c1 = Math.cos(t1);
  const s12 = Math.sin(t1+t2), c12 = Math.cos(t1+t2);
  const J11 = -L1*s1 - L2*s12;
  const J12 = -L2*s12;
  const J21 = L1*c1 + L2*c12;
  const J22 = L2*s12;
  return [[J11,J12],[J21,J22]];
}

Show explanation

Line 7 is the bug. J(2,2) = ∂y/∂θ₂ = L₂·cos(θ₁+θ₂) = L2*c12, but the code uses L2*s12 (sine instead of cosine). This would give wrong end-effector velocities whenever joint 2 is moving. The correct line: const J22 = L2*c12;

Chapter 8: PID Control

You've solved kinematics — you know WHERE to go. Now: how do you actually GET there? A PID controller computes the motor command based on the error between desired and actual position. It's the workhorse of industrial control: simple, tunable, and good enough for 90% of robotic tasks.

Continuous PID:
u(t) = K_p·e(t) + K_i·∫e(τ)dτ + K_d·de/dt

Discrete PID:
u[k] = K_p·e[k] + K_i·(∑_j=0^k e[j]·dt) + K_d·(e[k] - e[k-1])/dt

Terms:
e(t) = target - actual // error
P term = K_p·e // proportional: big error → big correction
I term = K_i·∑e // integral: persistent error → accumulated correction
D term = K_d·Δe/dt // derivative: fast change → damping

Tuning intuition. Too much K_p: oscillates around the target (overcorrects). Too much K_i: overshoots then slowly drifts back (integral windup). Too much K_d: jittery and amplifies sensor noise. The art of PID tuning is balancing these three competing effects. Start with K_d=K_i=0, increase K_p until it oscillates, then add K_d to damp, then add a small K_i to eliminate steady-state error.

Exercise 8.1: P-Only Step 1 Derive

Target = 10, actual = 7, K_p = 2, K_i = 0, K_d = 0. What is the control output u?

Show derivation

e = target - actual = 10 - 7 = 3

u = K_p·e = 2 × 3 = 6

Pure proportional control: the output is simply the gain times the error. As the system approaches the target, the error shrinks and the correction weakens. This is why P-only control has steady-state error — the correction vanishes before reaching zero error.

Exercise 8.2: PID Over 3 Steps Derive

K_p=1, K_i=0.1, K_d=0.5, dt=1. Errors over 3 steps: e[0]=5, e[1]=3, e[2]=1. Assume e[-1]=0 (initial). What is u[2]?

u[k] = Kp·e[k] + Ki·(sum of all e×dt) + Kd·(e[k]-e[k-1])/dt.

u[2]

Show derivation

P term = K_p × e[2] = 1 × 1 = 1

I term = K_i × (e[0]+e[1]+e[2]) × dt = 0.1 × (5+3+1) × 1 = 0.9

D term = K_d × (e[2]-e[1])/dt = 0.5 × (1-3)/1 = -1

u[2] = 1 + 0.9 + (-1) = 0.9

The P term says "error is 1, push with force 1." The I term says "errors have been accumulating (total 9), push a little extra." The D term says "error is decreasing (from 3 to 1), back off to avoid overshoot." The net command 0.9 is a gentle push — the system is converging nicely.

Exercise 8.3: Diagnose the Behavior Trace

A robot arm oscillates rapidly around the target position without settling. Which PID parameter is likely too high?

K_p — too much proportional gain causes oscillation (overcorrection) K_i — integral windup K_d — noise amplification All three are equally likely

Show explanation

Rapid oscillation around the target is the hallmark of excessive K_p. The controller overcorrects: it pushes too hard, overshoots, sees a negative error, pushes back too hard, overshoots again. K_i issues cause slow drift/overshoot (not rapid oscillation). K_d issues cause jitter from noise, not sustained oscillation. Fix: reduce K_p and/or add K_d for damping.

Exercise 8.4: Implement pidStep() Build

Implement a single PID step. Takes current error, previous error, accumulated integral, gains, and dt. Returns [u, newIntegral].

newIntegral = integral + e*dt. u = Kp*e + Ki*newIntegral + Kd*(e-ePrev)/dt.

Show solution

javascript
function pidStep(e, ePrev, integral, Kp, Ki, Kd, dt) {
  const newIntegral = integral + e * dt;
  const P = Kp * e;
  const I = Ki * newIntegral;
  const D = Kd * (e - ePrev) / dt;
  return [P + I + D, newIntegral];
}

Exercise 8.5: Steady-State Error Trace

A P-only controller (K_i=K_d=0) controlling a motor against gravity settles at a constant error. Which term do you add to eliminate this?

Increase K_p more — brute force Add K_i — the integral term accumulates until the error reaches zero Add K_d — derivative will push harder Switch to a different controller entirely

Show explanation

Steady-state error occurs when the proportional correction exactly balances the disturbance (gravity). The system is in equilibrium: K_p×e = gravity force. The error e is nonzero because that's what generates the force to hold position. Adding K_i breaks this equilibrium: the integral accumulates over time, continuously increasing the control effort until e=0. Only then does the integral stop growing. This is why the I term exists — it's the only PID component that guarantees zero steady-state error.

Exercise 8.6: D-Term Sign Derive

Error decreasing: e[k]=2, e[k-1]=5, dt=0.1. K_d=1. What is the D term?

D term

Show derivation

D = K_d × (e[k] - e[k-1]) / dt = 1 × (2 - 5) / 0.1 = -30

The D term is negative because the error is decreasing rapidly. It acts as a brake — opposing the direction of movement to prevent overshoot. The magnitude is large (-30) because the error changed by 3 in just 0.1 seconds, so the derivative (rate of change) is -30/s. This is why K_d must be tuned carefully: too much and the system sluggishly approaches the target.

Chapter 9: Capstone — Robot Navigation

A mobile robot with wheel odometry and a LiDAR that detects known landmarks. Everything from Chapters 0-8 comes together: transforms to convert between frames, the EKF to fuse odometry with landmark measurements, and PID to execute the path. Let's work through a complete navigation cycle.

The full pipeline. Real robot navigation is: (1) predict pose from odometry, (2) detect landmarks in sensor frame, (3) transform to world frame, (4) compute innovation, (5) update EKF, (6) compute control. This capstone tests each piece with concrete numbers.

Exercise 9.1: EKF State Design Design

Design the EKF pipeline for a robot with wheel odometry and LiDAR landmark detection. Arrange the steps in the correct order.

→

Predict (motion model) Detect landmark Compute innovation EKF update (K, x+, P+) PID control command

Show explanation

The EKF loop: (1) Predict the new pose using the motion model, (2) Detect a landmark via LiDAR, (3) Innovate — compare predicted vs actual measurement, (4) Update the state estimate using the Kalman gain, (5) Control — compute the next motor command based on the updated pose estimate.

Exercise 9.2: Full Predict Step Derive

Robot at x=[5, 3, π/6] (30° heading). Drives v=1 m/s, ω=0.1 rad/s for dt=1s. What is the new x-coordinate?

x' = x + v·cos(θ)·dt. cos(30°) = √3/2 ≈ 0.8660.

x (meters)

Show derivation

x' = 5 + 1 × cos(π/6) × 1 = 5 + 0.8660 = 5.8660

y' = 3 + 1 × sin(π/6) × 1 = 3 + 0.5 = 3.5

θ' = π/6 + 0.1 × 1 = 0.5236 + 0.1 = 0.6236 rad = 35.73°

New state: [5.866, 3.5, 0.624]. The robot moved mostly in x (because cos30° > sin30°) and turned slightly leftward (heading increased by 0.1 rad ≈ 5.7°).

Exercise 9.3: Landmark Observation Derive

From the predicted pose [5.866, 3.5, 0.624], the robot observes a landmark at world position (10, 7). What is the predicted range?

meters

Show derivation

d_x = 10 - 5.866 = 4.134

d_y = 7 - 3.5 = 3.5

r = √(4.134² + 3.5²) = √(17.09 + 12.25) = √29.34 ≈ 5.42

The landmark is about 5.4 meters away. If the LiDAR measures 5.3m instead, the innovation is -0.12m — suggesting the robot is slightly closer to the landmark than predicted. The EKF update will pull the pose estimate toward the landmark.

Exercise 9.4: Update with Measurement Derive

Simplified 1D: predicted x=5.866, P=0.5. LiDAR range suggests true x should be 5.7 (after inverting the measurement model). R=0.2. What is the updated x?

This is just the scalar Kalman update: K = P/(P+R), x+ = x + K(z-x).

x (meters)

Show derivation

K = P/(P+R) = 0.5/(0.5+0.2) = 0.5/0.7 = 0.7143

x⁺ = 5.866 + 0.7143 × (5.7 - 5.866) = 5.866 + 0.7143 × (-0.166) = 5.866 - 0.1186 = 5.747

K = 0.71, meaning we trust the measurement about 71% (because R=0.2 is smaller than P=0.5). The updated position shifts 71% of the way from the prediction toward the measurement. The updated P⁺ = (1-0.71) × 0.5 = 0.143, much smaller than both P and R — fusion always improves certainty.

Exercise 9.5: Updated Covariance Derive

From the previous exercise: P=0.5, K=0.7143. What is the updated covariance P⁺ = (1-K)P?

P⁺

Show derivation

P⁺ = (1 - 0.7143) × 0.5 = 0.2857 × 0.5 = 0.1429

The covariance dropped from 0.5 to 0.143 — the uncertainty decreased by 71%. This is the fundamental guarantee of the Kalman filter: fusing two uncertain estimates always produces a better estimate than either one alone. The updated covariance is always smaller than both the prediction covariance P and the measurement noise R.

Exercise 9.6: Navigation Controller Derive

Updated pose: x=5.747, target waypoint: x=10. K_p=0.5 for the velocity controller (u = K_p × position_error). What velocity command does the PID produce?

m/s

Show derivation

e = 10 - 5.747 = 4.253

u = K_p × e = 0.5 × 4.253 = 2.127 m/s

The P controller commands 2.1 m/s. In practice, you'd clamp this to the robot's maximum speed. As the robot approaches the waypoint, the error shrinks and the velocity naturally decreases — this is the beauty of proportional control for waypoint following. A real system would add K_d to avoid overshoot at the waypoint.

Exercise 9.7: Full Jacobian F Derive

For the capstone prediction (θ=π/6, v=1, dt=1), compute the (2,3) entry of the Jacobian F. F(2,3) = v·cos(θ)·dt.

F(2,3)

Show derivation

F(2,3) = v · cos(θ) · dt = 1 × cos(π/6) × 1 = √3/2 ≈ 0.866

Full Jacobian F = [[1, 0, -0.5], [0, 1, 0.866], [0, 0, 1]]

F(2,3)=0.866 says: a small change in heading increases the y-velocity component. At θ=30°, the robot is mostly moving in x; a small heading increase rotates velocity toward y. F(1,3)=-0.5 = -sin(30°) says: that same heading change reduces x-velocity, because rotating away from the x-axis.

The proof of work. If you completed every exercise in this workbook — rotated points, chained transforms, computed quaternions, derived Jacobians for the EKF and kinematic chains, traced SLAM graph optimization, tuned PID controllers, and assembled a full navigation pipeline — you understand the math that makes robots move. These are the exact calculations running inside every mobile robot, drone, and robot arm. "What I cannot create, I do not understand."

Related Lessons

Topic	Lesson
Kalman filtering	Kalman Filter — From Absolute Zero
Extended Kalman filter	EKF — From Absolute Zero
Classical SLAM	Classical SLAM — From Absolute Zero
Classical VIO	Classical VIO — From Absolute Zero
MDPs & planning	MDP — From Absolute Zero