My solution to the homework

Lab0

Localization: Determine the pose of a robot relative to a given map of the environment.
Mapping: Construct a map.
Planning: Plan a trajectory to goal based on localization and maps.

Bayes Filter

The Bayes filter algorithm includes two steps, the first step is prediction (push belief through dynamics given action), the second step is correction (apply Bayes rule given measurement).

$$ \begin{gather*} \overline{bel}\left(x_{t}\right) = \int p\left(x_{t} | u_{t}, x_{t-1}\right) bel\left(x_{t-1}\right) d x_{t-1} \\ bel\left(x_{t}\right) = \eta p\left(z_{t} | x_{t}\right) \overline{bel}\left(x_{t}\right) \end{gather*} $$

Kalman Filter

Assumptions:

Linear dynamics $p(x_t | u_t, x_{t-1}) = A_{t} x_{t-1}+B_{t} u_{t}+\varepsilon_{t}$, where $\varepsilon_t \sim \mathcal{N}(0, R_t)$
Linear measurement model $p(z_t | x_t) = C_{t} x_{t}+\delta_{t}$, where $\delta_t \sim \mathcal{N}(0, Q_t)$
Initial belief $bel(x_0)$ is a Gaussian distribution $$ bel\left(x_{0}\right)=p\left(x_{0}\right)=\det\left(2 \pi \Sigma_{0}\right)^{-\frac{1}{2}} \exp \left\{-\frac{1}{2}\left(x_{0}-\mu_{0}\right)^{T} \Sigma_{0}^{-1}\left(x_{0}-\mu_{0}\right)\right\} $$

Algorithm:

$$ \begin{align*} &\textbf{Algorithm KalmanFilter} (\mu_{t-1}, \Sigma_{t-1}, u_t, z_t): \\ &\bar{\mu}_{t} =A_{t} \mu_{t-1}+B_{t} u_{t} \\ &\bar{\Sigma}_{t} =A_{t} \Sigma_{t-1} A_{t}^{T}+R_{t} \\ &K_{t} =\bar{\Sigma}_{t} C_{t-1}^{T}\left(C_{t} \bar{\Sigma}_{t} C_{t}^{T}+Q_{t}\right)^{-1} \\ &\mu_{t} =\bar{\mu}_{t}+K_{t}\left(z_{t}-C_{t} \bar{\mu}_{t}\right) \\ &\Sigma_{t} =\left(I-K_{t} C_{t}\right) \bar{\Sigma}_{t} \\ &\text{return } \mu_{t}, \Sigma_{t} \end{align*} $$

Lab1

Motion Model

Kinematic model: map wheel speeds to robot velocities.
Dynamic model: map wheel torques to robot accelerations.

Consider only kinematic model $p(x_t | u_t, x_{t-1})$ for now (assume we can set the speed directly), then $x = [x, y, \theta]^T$, $u = [V, \delta]^T$, where $\theta$ is the heading and $\delta$ is the steering angle.

Now we will derive the motion model for rear axel. Note that a rigid body undergoing rotation and translation can be viewed as pure rotation about a instant center of rotation:

$$ \begin{align*} &\dot{x}=V \cos (\theta)\\ &\dot{y}=V \sin (\theta)\\ &\dot{\theta}=\omega=\frac{V \tan \delta}{L} \end{align*} $$

With numerical integration we can get

$$ \begin{align*} x_{t+1} &=x_{t}+\frac{L}{\tan (\delta)}\left(\sin \left(\theta_{t+1}\right)-\sin \left(\theta_{t}\right)\right) \\ y_{t+1} &=y_{t}+\frac{L}{\tan (\delta)}\left(-\cos \left(\theta_{t+1}\right)+\cos \left(\theta_{t}\right)\right) \\ \theta_{t+1} &=\theta_{t}+\frac{v}{L} \tan (\delta) \Delta t \quad \end{align*} $$

Noise

Control signal error: $\hat{V} \sim \mathcal{N}(V, \sigma_v^2)$, $\hat{\delta} \sim \mathcal{N}(\delta, \sigma_\delta^2)$
Incorrect physics (the Kinematic Model is inaccurate): $\hat{x} \sim \mathcal{N}\left(x, \sigma_{x}^{2}\right)$, $\hat{y} \sim \mathcal{N}\left(y, \sigma_{y}^{2}\right)$, $\hat{\theta} \sim \mathcal{N}\left(\theta, \sigma_{\theta}^{2}\right)$

Sensor Model

$p(z_t | x_t, m)$ is the probability of sensor reading $z_t$ given state $x_t$ and map $m$. Calculate simulated sensor reading $z^*$ form $x$ and $m$ and then compare with $z$.

Assume individual beams are conditionally independent given map (may result in overconfidence problem):

$$ p\left(z_{t} | x_{t}, m\right)=\prod_{i=1}^{K} p\left(z_{t}^{k} | x_{t}, m\right) $$

Noise

Simple measurement noise in distance value
$$ \require{mathtools} p_{\text {hit}}\left(z_{t}^{k} | x_{t}, m\right)=\begin{dcases} \eta \mathcal{N}\left(z_{t}^{k} ; z_{t}^{k *}, \sigma_{\text {hit}}^{2}\right) & \text {if } 0 \le z_{t}^{k} \le z_{\max} \\ 0 & \text { otherwise } \end{dcases} $$
Presence of unexpected objects
$$ p_{\text {short}}\left(z_{t}^{k} | x_{t}, m\right)=\begin{dcases} \eta \lambda_{\text {short}} e^{-\lambda_{\text {short}} z_{t}^{k}} & \text {if } 0 \le z_{t}^{k} \le z_{t}^{k *} \\ 0 & \text {otherwise } \end{dcases} $$
Laser returns max range when no objects
$$ p_{\max}\left(z_{t}^{k} | x_{t}, m\right)=I\left(z=z_{\max}\right)=\begin{cases} 1 & \text {if } z=z_{\max } \\ 0 & \text {otherwise } \end{cases} $$
Failures in sensing
$$ p_{\text {rand}}\left(z_{t}^{k} | x_{t}, m\right)=\begin{dcases} \frac{1}{z_{\max}} & \text {if } 0 \le z_{t}^{k}

Combine these 4 model to get

$$ p\left(z_{t}^{k} | x_{t}, m\right)=\begin{pmatrix} z_{\mathrm{hit}} \\ z_{\text {short}} \\ z_{\text {max}} \\ z_{\text {rand}} \end{pmatrix}^{T} \begin{pmatrix} p_{\text {hit}}(z_{t}^{k} | x_{t}, m) \\ p_{\text {short}}(z_{t}^{k} | x_{t}, m) \\ p_{\text {max}}(z_{t}^{k} | x_{t}, m) \\ p_{\text {rand}}(z_{t}^{k} | x_{t}, m) \end{pmatrix} $$

Particle Filter

The particle filter is an alternative nonparametric implementation of the Bayes filter. The key idea is to represent the posterior $bel(x_t)$ by a set of random state samples (particles) drawn from this posterior.

$$ \begin{align*} &\textbf{Algorithm ParticleFilter} (\mathcal{X}_{t-1}, u_t, z_t ): \\ &\bar{\mathcal{X}}_{t}=\mathcal{X}_{t}=\varnothing \\ &\text{for } m=1 \text{ to } M \text{ do} \\ &\qquad \text{sample } x_{t}^{[m]} \sim p\left(x_{t} | u_{t}, x_{t-1}^{[m]}\right) \\ &\qquad {w_{t}^{[m]}=p\left(z_{t} | x_{t}^{[m]}\right)} \\ &\qquad {\bar{\mathcal{X}}_{t}=\bar{\mathcal{X}}_{t}+\left\langle x_{t}^{[m]}, w_{t}^{[m]}\right\rangle} \\ &\text{endfor } \\ &\text{for } m = 1 \text{ to } M \text{ do} \\ &\qquad \text{draw } i \text{ with probability } \propto w_{t}^{[i]} \\ &\qquad \text{add } x_t^{[i]} \text{ to } \mathcal{X}_t \\ &\text{endfor} \\ &\text{return } \mathcal{X}_t \end{align*} $$

We first sample $x_t$ from the state transition distribution, then calculate the importance factor, $w_t$. In the next loop, we do the resampling/importance sampling, to change the distribution of particles from $\overline{bel}(x_t)$ to $bel(x_t)$.

Resample

Resampling can cause high-variance (low-entropy) problem, where particles are depleted. Possible fixes:

If the variance of weights low, don’t resample.
Use low-variance sampling.

$$ \begin{align*} &\textbf{Algorithm LowVarianceSampler} (\mathcal{X}_{t}, \mathcal{W}_{t}): \\ &\bar{X}_{t}=\varnothing \\ &r=\operatorname{rand}\left(0 ; M^{-1}\right) \\ &c=w_{t}^{[1]} \\ &i=1 \\ &\text{for } m=1 \text{ to } M \text{ do} \\ &\qquad U=r+(m-1) \cdot M^{-1} \\ &\qquad \text{while } U > c \\ &\qquad\quad i=i+1 \\ &\qquad\quad c=c+w_{t}^{[i]} \\ &\qquad \text{endwhile} \\ &\qquad \text{add } x_{t}^{[i]} \text{ to } \bar{X}_{t} \\ &\text{endfor } \\ &\text{return } \bar{\mathcal{X}}_t \end{align*} $$

Expected Pose

Calculate expected pose from particles: $x$ and $y$ can be computed directly by the weighted average. However, the weighted average of $\theta$ is not accurate. Thus we use cosine and sine averaging. (Ref)

Code

MotionModel.py:

Subscribe to motor topic, get control info (speed and steering angle), also save last frame info for calculation.
Add variance to speed and steering angle. Apply motion model on particles. Add variance to states.

SensorModle.py:

Precompute the sensor model table and use range_libc package to achieve 2D raycasting for 2D occupancy grid.
Subscribe to scan topic, filter out invalid and extreme scan values, downsample, and pass processed data to update weights.

ParticleFilter.py:

Transfer map to occupancy grid.
Globally initialize particles and weights. Initialize motion model and sensor model.
Subscribe to /intialpose topic, initialize particles and weights around initial pos.
When receiving scan infos, update weights, resample and calculate the expected pose for visualization.

Lab2

	Uses Model	Stability Guarantee	Minimize Cost
PID	No	No	No
Pure Pursuit	Circular arcs	Yes - with assumptions	No
Lyapunov	Non-linear	Yes	No
LQR	Linear	Yes	Quadratic
iLQR	Non-linear	Yes	Yes

Control

Position error in frame A:

$$ {}^{A}e = \begin{bmatrix} x \\ y \end{bmatrix} - \begin{bmatrix} x_{ref} \\ y_{ref} \end{bmatrix} $$

We want position error in frame B so that $x$ and $y$ error correspond to along-track and cross-track error respectively:

$$ \begin{split} {}^{B}e &= {}_A^{B} R {}^{A}e \\ &=R(-\theta_{ref}) \left( \begin{bmatrix} x \\ y \end{bmatrix} - \begin{bmatrix} x_{ref} \\ y_{ref} \end{bmatrix} \right) \\ &= \begin{bmatrix} \cos \left(\theta_{r e f}\right)\left(x-x_{r e f}\right)+\sin \left(\theta_{r e f}\right)\left(y-y_{r e f}\right) \\ -\sin \left(\theta_{r e f}\right)\left(x-x_{r e f}\right)+\cos \left(\theta_{r e f}\right)\left(y-y_{r e f}\right) \end{bmatrix} \\ &= \begin{bmatrix} e_{at} & e_{ct} \end{bmatrix}^T \end{split} $$

Only consider cross-track error $e_{ct}$ and control steering angle for now.

PID

$$ u=-\left(K_{p} e_{c t}+K_{i} \int e_{c t}(t) d t+K_{d} \dot{e}_{c t}\right) $$

where

$K_p$: Proportional coefficient
$K_i$: Integral coefficient
$K_d$: Derivative coefficient

Analytically compute the derivative term:

$$ \begin{split} \dot{e}_{c t} &=-\sin \left(\theta_{ref}\right) \dot{x}+\cos \left(\theta_{ref}\right) \dot{y} \\ &=-\sin \left(\theta_{ref}\right) V \cos (\theta)+\cos \left(\theta_{ref}\right) V \sin (\theta) \\ &=V \sin \left(\theta-\theta_{ref}\right) \\ &=V \sin \left(\theta_{e}\right) \end{split} $$

Pure-Pursuit

Key idea: The car is always moving in a circular arc.

Find a lookahead and compute arc
Move along the arc
Go to step 1

Solve for arc:

$$ \begin{gather*} \alpha =\tan ^{-1}\left(\frac{y_{ref}-y}{x_{ref}-x}\right)-\theta \\ R = \frac{L}{2 \sin\alpha} \\ \dot{\theta} = \frac{V}{R} = \frac{2V\sin\alpha}{L} \\ \dot{\theta} = \frac{V\tan u}{B} \end{gather*} $$

where $B$ is the car length. Solve for $u$ we can get

$$ u = \tan^{-1}\left( \frac{2B \sin \alpha}{L} \right) $$

Lyapunov Control

Define Lyapunov function:

$$ V\left(e_{ct}, \theta_{e}\right)=\frac{1}{2} k_{1} e_{ct}^{2}+\frac{1}{2} \theta_{e}^{2} $$

Compute the derivative:

$$ \begin{split} \dot{V}\left(e_{c t}, \theta_{e}\right)&=k_{1} e_{c t} \dot{e}_{c t}+\theta_{e} \dot{\theta}_{e}\\ &=k_{1} e_{c t} V \sin \theta_{e}+\theta_{e} \frac{V}{B} \tan u \end{split} $$

Set $u$ to get $\dot{V} < 0$:

$$ \begin{gather*} \theta_{e} \frac{V}{B} \tan u=-k_{1} e_{c t} V \sin \theta_{e}-k_{2} \theta_{e}^{2} \\ \tan u=-\frac{k_{1} e_{c t} B}{\theta_{e}} \sin \theta_{e}-\frac{B}{V} k_{2} \theta_{e} \\ u=\tan ^{-1}\left(-\frac{k_{1} e_{c t} B}{\theta_{e}} \sin \theta_{e}-\frac{B}{V} k_{2} \theta_{e}\right) \end{gather*} $$

LQR

Turn the problem into an optimization to trade-off both driving error and keeping control action small:

$$ \min _{u(t)} \int_{0}^{\infty}\left(w_{1} e(t)^{2}+w_{2} u(t)^{2} \right) dt $$

Given

Linear dynamic system $$ x_{t+1} = Ax_t + Bu_t $$
Quadratic cost $$ J=\sum_{t=0}^{T-1} x_{t}^{T} Q x_{t}+u_{t}^{T} R u_{t} $$

The optimal control sequence minimizing the cost is

$$ \begin{gather*} u_t = K_t x_t \\ K_{t}=-\left(R+B^{T} V_{t+1} B\right)^{-1} B^{T} V_{t+1} A \\ V_{t}=Q+K_{t}^{T} R K_{t}+\left(A+B K_{t}\right)^{T} V_{t+1}\left(A+B K_{t}\right) \end{gather*} $$

MPC

Plan a sequence of control actions
Predict the set of next states to a horizon H
Evaluate the cost/constraint of the states and controls
Optimize the cost

Code

controlnode.py: main script

Define Subscribers to pose and path infos; Publishers for visualization and Services for reset
Enter main program when initial pose is set and controller is ready. Get pose and reference pose, get next control and publish to /vesc/high_level/ackermann_cmd_mux/input/nav_0 topic.
Stop when path is completed

runner_script.py:

Loading different paths and speed
Start the controller

controller.py: base controller class

Store path info
Define utility functions: get reference pose by index, get pose and pose_ref error

Find reference pose function is the same for all controllers: Find the nearest point on the path and lookahead some distance.

pid.py:

Use PD equation to get next control (steering angle)

purepursuit.py:

Use Purepursuit equation to get next control

mpc.py:

Dividing steering angle $[-\pi, \pi]$ equally to $K$ rollouts
Execute each steering angel through $T$ timesteps to collect $K * T$ poses
Evaluate the cost of each rollout by collision cost and error cost
1. Collision cost: if any pose in a trajectory is in collision with the map, add a large cost
2. Error cost: norm of the distance between last pose and the reference pose, weighted by a constant
Choose the rollout with the minimal cost and execute the first step
Return to step 1

mpc2.py: Similar to mpc but use scan info rather than map. The only thing change is the obstacles cost.

Calculate $N$ obstacles pose in the map by scan info
For $K*T$ poses, calculate the distance with every obstacles to get $K*T*N$ array
Find the minimal distance for every pose to get $K * T$ array
Average through timesteps to get $K$ array, then weighted by a constant to get the obstacles cost.

nonlinear.py:

Use Lyapunov control equation to get the control

Lab3

Steps for planing the path from one point to another:

Random sample points on the map and construct the graph
Use planning algorithm (A*) to search the graph for optimal path

Graph Construction

The graph we are constructing is called Random geometric graph (RGG)

Sample a set of collision free vertices $V$
Connect neighboring vertices to get edges $E$

Sampling

Uniform random sampling tends to clump. We want points to be spread out evenly, which can be achieved by Halton sequence

Optimal Radius

We want to choose the radius smaller enough for efficiency while ensuring connectivity. The optimal value can be chosen as

$$ r = \left(\frac{\ln|V|}{\alpha_{p,d} |V|}\right)^{1/n} $$

where $\alpha_{p,d}$ is a constant. For special case of a two-dimensional space and the euclidean norm ($d=2$, $p=2$), $\alpha_{p,d} = \pi$

Dubins Path

Since we are considering 2-D car dynamics, we need to connect two points with feasible path instead of straight lines. Mathematically, we need to solve the BVP:

$$ \dot{q}(t) = f(q(t), u(t)) \\ q(0) = q_1,\quad q(t) = q_2 $$

where $q=(x,y,\theta)$ in our case.

Dubins path shows that the solution always exists and has to be one of 6 classes:

$$ \{LRL, RLR, LSL, LSR, RSL, RSR\} $$

where $L,R,S$ represent turn left, right, straight, respectively.

Planning Algorithm

Given start node $s_0$, goal $s_1$ and cost $c(s, s')$, create objects:

OPEN: priority queue of nodes to be processed
CLOSED: list of nodes already processed
$g(s)$: estimate of the least cost from start to a given node

The pseudocode for best first search can be expressed as

Push $s_0$ into OPEN
While $s_1$ not expanded 3. Pop best from OPEN 4. Add best to CLOSED 5. For every successor s’ 6. If $g(s') > g(s) + c(s, s')$ 7. $g(s') = g(s) + c(s, s')$ 8. Add (update) $s'$ to OPEN

The main problem is how to choose the heuristic function $f(s)$ for step 3.

Dijkstra’s Algorithm

Choose $f(s) = g(s)$. Always pop the node with the smallest cost from the origin first.

A*

If we can pre-evaluate the cost from the node to the goal $h(s)$, then we can choose a better $f(s) = g(s) + h(s)$.

If $h(s)$ is admissible $h(s) \le h^*(s)$, $h(\text{goal}) = 0$, then the path return by A* is optimal.
If $h(s)$ is consistent $h(s) \le c(s, s') + h(s')$, $h(\text{goal}) = 0$, then A* is optimal and efficient (will not re-expand a node).

All consistent heuristics are admissible, not vice versa.

Weighted A*

Choose $f(s) = g(s) + \epsilon h(s)$, where $\epsilon > 1$. It is more efficient and the solution is $\epsilon$-optimal $c \le \epsilon c^*$

Lazy A*

Instead of checking edge collision for all neighbors, only check the edge to parent when expanding. OPEN list will have multiple copies of a node since we haven’t collision check.

Shortcut

After a path is found, we can randomly pick two nodes and connect them directly if the edge is collision-free.

Code

run.py: main program

Load map info
Construct graph given environment, sampler, number of vertices, connection radius.
Add start and end node
Use A* or lazy A* algorithm to search the optimal path and visualize

MapEnvironment.py: define utility functions associated with planning

Check edge collision
Compute heuristic and distance function
Generate path on the map
Visualization of the graph and path

Sampler.py: create random samples for graph construction

Use halton sequence to generate 2-D random vertices in $(0,1)$
Scale by map info

graph_maker.py: construct graph

Use python package NetworkX to easily construct a graph
Add sampled valid vertices
Connect edges within radius and without collision (do not check collision if using lazy A*)

astar.py: A* algorithm

Use heapq package to create priority queue to store [f(s), count, node, g(s), parent] info. count is used to prevent comparing node when $f(s)$ are the same.
Use dict enqueued to store $g(s)$ and $h(s)$ for a node, and another dict explored to store the explored node and its parent node.
Add start node to the queue.
While queue is not empty, pop one node and add it to explored if it is not already there.
For all the neighbor node $s'$, compute $g(s') = g(s) + c(s,s')$ if $s'$ is not already in explored.
If $s'$ is in enqueued, get its previous $g'(s')$ and $h'(s')$. Continue to next neighbor if $g'(s) \le g(s')$, since it is better. If not in enqueued, compute $h(s')$ by the heuristic function.
Update $g(s')$ and $h(s')$ in enqueued and push it to the queue.
If reach target node, continually find parent node by explored and return the path.

lazy_astar.py: lazy A* algorithm

When expanding node, check its edge collision with parent.
When checking neighbors, do not need to compare $g'(s')$ and $g(s')$
Multiple nodes with different parents can be added to the queue.

runDubins.py: similar to run.py but use Dubins environment

DubinsMapEnvironment.py: inherited from MapEnvironment

Compute distances with Dubins path
Compute heuristic with Dubins path
Generate path by Dubins path planning

DubinsSampler.py:

Sample $N$ vertices same as in Sampler.py
Divide angles to $M$ parts, add angles to each vertices to create $M\times N$ samples

Dubins.py: utility function for Dubins path

ROSPlanner.py: take goal from rviz and do planning. Be careful of the frame change between map and world

Load map info
Construct graph and save for later use
Subscribe to pose topic, save current pose
Subscribe to goal topic, plan from the current pose to the goal, publish the path using service defined in lab2
If there are multiple goals, sequentially planning through them and combine the path

Mobile Robots

Lab0

Bayes Filter

Kalman Filter

Lab1

Motion Model

Noise

Sensor Model

Noise

Particle Filter

Resample

Expected Pose

Code

Lab2

Control

PID

Pure-Pursuit

Lyapunov Control

LQR

MPC

Code

Lab3

Graph Construction

Sampling

Optimal Radius

Dubins Path

Planning Algorithm

Dijkstra’s Algorithm

A*

Weighted A*

Lazy A*

Shortcut

Code

Reference

Lab0#

Bayes Filter#

Kalman Filter#

Lab1#

Motion Model#

Noise#

Sensor Model#

Noise#

Particle Filter#

Resample#

Expected Pose#

Code#

Lab2#

Control#

PID#

Pure-Pursuit#

Lyapunov Control#

LQR#

MPC#

Code#

Lab3#

Graph Construction#

Sampling#

Optimal Radius#

Dubins Path#

Planning Algorithm#

Dijkstra’s Algorithm#

A*#

Weighted A*#

Lazy A*#

Shortcut#

Code#

Reference#

Lab0

Bayes Filter

Kalman Filter

Lab1

Motion Model

Noise

Sensor Model

Noise

Particle Filter

Resample

Expected Pose

Code

Lab2

Control

PID

Pure-Pursuit

Lyapunov Control

LQR

MPC

Code

Lab3

Graph Construction

Sampling

Optimal Radius

Dubins Path

Planning Algorithm

Dijkstra’s Algorithm

A*

Weighted A*

Lazy A*

Shortcut

Code

Reference