Given a weighted directed graph, how to find the shortest path between two nodes?
- If the graph is not weighted, we can assume each edge has an equal weight of one.
- If the graph is not directed, we can think the edge is bi-directional.
- In case of multiple edges, we can just keep the edge with smallest weight.
- Self edges will be consider as zero weight.
Bellman-Ford Algorithm
- Starting from some basic observation: Each edge already gives us the shortest path between its two nodes. For not directly connected nodes, their shortest paths contains more than one edge.
- We can define a sub questions: By using at most k edges, what is the shortest path between any two nodes. When k=1, the shortest path is the edge weight if they are connected, otherwise, it is infinite.
- Optimal sub-structure: dp[k][x][y] = min(dp[k-1][x][z]+dp[1][z][y]), for z in all nodes
- dp[1][z][y] is only non infinite if there is edge between z and y. Therefore, the above structure is equivalent to dp[k][x][y] = min(dp[k-1][x][z]+weight[z][y]), for z in all y’s ancestors.
- Traversing all x, y and y’s ancestors is equivalent to traverse all x (a src node) and all edges, or traverse all edges, then all src nodes.
- Original question: ans[x][y] = min(dp[k][x][y]), for all possible k.
- By the optimal sub-structure, we know dp[k][x][y] keeps decreasing, when will it stop or will it stop?
- Case1: dp[k][x][y] doesn’t change for all x, y nodes when compared to dp[k-1][x][y]. Now, we know the dp values will not change any more. We found the final answer.
- Case2: dp[k][x][y] keeps decreasing when compare to dp[k-1][x][y], and it will not stop. This means dp values will go to negative. The shortest path from x to y keeps changing, more and more edges are added to the path.
- What happens if we have more and more edges in the path? It means at some point, we will have a loop in the path. Proof: a path can be denote as (node0 -> node1 -> … -> node_i). When there are N edges, there will be N+1 nodes showing up in the path. If we only have N nodes in the graph, it is guaranteed that some node appears at least twice in the path. It means we have a loop in the path.
- Similarly, if we keep having more edges, it means we are having more loops. In case2, combined the fact that we have unbound decreasing shortest path, it means we keep adding negative loops into the shortest path.
// Initialization
for (int src=0;src<N;++src) {
dp[src][src] = 0;
}
// At most N-1 edges in the path.
for (int k=1;k<=N-1;++k) {
for (int src=0;src<N;++src) {
for (auto [a, b, w]:Edges) {
// Be aware of the infinite case.
if (dp[src][a]==INT_MAX) continue;
// DP vector compression.
dp[src][b] = std::min(dp[src][b], dp[src][a] + w);
}
}
}
// Check if there is negative loop in the at most N edges case.
for (auto [a, b, w]:Edges) {
if (dp[b][b]>dp[b][a]+w) {
is_negative_loop=true;
}
}
// Bellman-Ford for single src case
dp[src] = 0;
for (int k=1;k<=N-1;++k) {
for (auto [a, b, w]:Edges) {
if (dp[a]==INT_MAX) continue;
dp[b] = std::min(dp[b], dp[a]+w);
}
}
// Detect negative loop. If dp[i] can be update to
// a smaller value, it means from src, we can go to
// a negative loop and then arrive node i. But if
// none of dp values can be update, it only means
// we cannot arrive a negative loop from src. It
// doesn't mean there isn't negative loop.
for (auto [a, b, w]: Edges) {
if (dp[b]>dp[a]+w) {
is_negative_loop=true;
}
}
SPFA (Shortest Path Faster Algorithm)
- Consider the DP process of Bellman-Ford Algorithm. Going from at-most k-1 edges case to at-most k edges case, if the shortest path between x and y nodes doesn’t change, then in the next at-most k+1 case, min(DP[k][x][y]+DP[1][y][z]) will not change for all z nodes.
- In other word, only if the shortest path between two nodes changes, we will be able to use it to construct other new shortest paths later. If the shortest path doesn’t change, we don’t need to consider using this path to update other path anymore.
- Worse case time complexity of SPFA is the same as Bellman-Ford algorithm. There are other ways to optimize Bellman-Ford algorithm (all kinds of priority queue with different strategies). But these optimization will might break the time complexity property of Bellman-Ford algorithm.
// SPFA for single score case
dp[src] = 0;
std::queue<int> q;
q.push(src);
// We don't need to keep track of the current
// maximum number of edges in shortest paths,
// if we know there is no negative loop
int max_edges_in_path = 0;
while (!q.empty() && max_edges_in_path<n) {
max_edges_in_path += 1;
int sz=q.size();
for (int i=0;i<sz;++i) {
int cur = q.front(); q.pop();
for (auto next:graph[cur]) {
if (dp[cur]+Weight[cur][next] < dp[next]) {
q.push(next);
}
}
}
}
if (max_edges_in_path==n) {
is_negative_loop = true;
}
Floyd-Warshall Algorithm
- This is another way to think about the sub optimal structure.
- We start from some basic observation again: if we restrict nodes (instead of restricting edges) we can use in shortest paths, what will happen?
- Suppose we already restricted shortest paths to use a certain set of nodes and we have all the shortest path calculated. Now, we add an additional node to the node set. All existing shortest paths need to reconsider, if they go to the new node first, then to the target node, will the path be shorter. After all nodes are added to the node set. Shortest paths have been all found.
- Optimal sub-structure: define dp[k][x][y] as the shortest path between x and y nodes by using first k nodes, dp[k][x][y] = min(dp[k-1][x][k]+dp[k-1][k][y]), for k in all nodes
// Initialization: we manually define the case
// where we don't any nodes in shortest paths
for (int i=0;i<N;++i) {
dp[i][i] = 0;
}
for (auto [a, b, w]:Edges) {
dp[a][b] = w;
}
// Using first k nodes in the solution
for (int k=0;k<N;++k) {
for (int i=0;i<N;++i) {
for (int j=0;j<N;++j) {
if (dp[i][k]==INT_MAX || dp[k][j]==INT_MAX)
continue;
dp[i][j] = std::min(dp[i][j], dp[i][k]+dp[k][j]);
}
}
}
// Detect negative loop. If dp[i][i]<0, it means
// node i is on a negative loop. If dp[i][i] >= 0,
// node i is not on a negative loop, but it might
// be able to reach a negative loop.
for (int i=0;i<N;++i) {
if (dp[i][i]<0) {
is_negative_loop = true;
}
}
Dijkstra Algorithm
- Algorithms above are DP methods. Is it possible to have greedy algorithms if we add some constrain?
- If we know one path is the shortest path between its two nodes, can we derive the next shortest path by greedy optimizing some local objective? For example, can I just check all the possible path extended from the existing path, and pick the one with smallest summed weight as the next shortest path? To achieve this, we shouldn’t allow a path first grow larger and then later come back with a smaller summed weight. This means we don’t allow negative edges showing up in the graph. Once we pick a path as shortest path, all extended paths from it will have a larger summed weight.
- To formalize, we maintain a priority queue for all candidate shortest paths. Each time, we take out the smallest weighted path as the shortest path and add newly extended paths into the priority queue as new candidates.
// <dst node, src-dst path weight>
using ele = std::pair<int, int>;
auto comp_fn = [](const ele& l, const ele& r){
return l.second > r. second;
};
std::priority_queue<ele, std::vector<ele>, decltype(comp_fn)> pq(comp_fn);
std::vector<bool> vis(N, false);
pq.emplace(src, 0);
while (!pq.empty()) {
auto [cur, cur_w] = pq.top(); pq.pop();
if (!vis[cur]) continue;
vis[cur] = true;
dis[cur] = cur_w;
for (auto next:graph[cur]) {
if (vis[next]) continue;
pq.emplace(next, cur_w+weight[cur][next]);
}
}
Others
- A better shortest path algorithm for all source node: Johnson’s algorithm, O(NM*logM)
Given a weighted non-directed graph, find its minimum span tree. Select n-1 edges that connect all n nodes, which have the minimum weight in total.
Prim Algorithm
Prim algorithm resembles Dijkstra algorithm.
- Dijkstra algorithm: greedy select the node who has the shortest path from source node
- Prim algorithm: greedy select the node who has the smallest weight to connect to current MST
// <dst node, edge weight>
using ele = std::pair<int, int>;
auto comp_fn = [](const ele& l, const ele& r){
return l.second > r. second;
};
std::priority_queue<ele, std::vector<ele>, decltype(comp_fn)> pq(comp_fn);
std::vector<bool> vis(N, false);
pq.emplace(src, 0);
int total_w = 0;
while (!pq.empty()) {
auto [cur, cur_w] = pq.top(); pq.pop();
if (!vis[cur]) continue;
vis[cur] = true;
total_w += cur_w;
for (auto next:graph[cur]) {
if (vis[next]) continue;
pq.emplace(next, weight[cur][next]);
}
}
Kruskal Algorithm
Prim algorithm is greedy selecting the minimal weighted edge locally. Kruskal will select the minimal weighted edge globally, but using union find data structure to check connectivity in O(log N) time.
std::vector<int> uf(n);
std::vector<int> rank(n, 1);
// Initialization.
for (int i=0;i<n;++i) uf[i] = i;
// Find function.
int Find(int x) {
while (x!=uf[x]) x=uf[x];
return x;
}
// Union function.
void Union(int x, int y) {
int rootx = Find(x), rankx = rank[rootx];
int rooty = Find(y), ranky = rank[rooty];
if (rankx < ranky) {
uf[rootx] = rooty;
} else if (rankx > ranky) {
uf[rooty] = rootx;
} else {
uf[rootx] = rooty;
rank[rooty]++;
}
}
// <weight, x, y>
using edge = std::tuple<int, int, int>;
// Kruskal algorithm
std:vector<edge> edges;
sort(edges.begin(), edges.end());
int total_w = 0;
for (auto [w, x, y]:edges) {
if (Find(x) != Find(y)) {
Union(x, y);
total_w += w;
}
}