kSPA - The k-Shortest Paths Algorithms Go package

Base interface

All kSP-algorithms implements Searcher interface:

	TopK(g *MultiGraph, srcId int, targetId int, topK int) (res PriorityQueue)
	TopKOneToOne(g *MultiGraph, srcIds []int, targetIds []int, topK int) (res []PriorityQueue)
	TopKOneToMany(g *MultiGraph, srcIds []int, targetIds []int, topK int) (res []PriorityQueue)

where

g - pointer to the directed cyclic graph (DCG) MultiGraph,
srcId - the start point Id of desired paths,
srcIds - array of start points,
targetId - the end point Id of desired paths,
targetIds - array of end points,
topK - desired count of paths.

TopK function return the PriorityQueue object with top k-paths from the single source to the single target, where path's weight → min.

TopKOneToOne function return the PriorityQueue object with top k-paths for every pair single source → single target stored in corresponding positions of arrays srcIds and targetIds.

TopKOneToMany function return the PriorityQueue object with top k-paths for every pair single source → any target stored in i-th position of array srcIds and every position of targetIds.

All subroutines exposed above exploit MultiGraph object as storage of Directed cyclic Graph.

Directed Cyclic Graph

MultiGraph object supports several operations with DCG

func (g *MultiGraph) Build(ent EntitySeq)
func (g *MultiGraph) Pred(v int) MEdgeSeq
func (g *MultiGraph) Succ(u int) MEdgeSeq
func (g *MultiGraph) UpdateRelation(ent EntitySeq) error
func (g *MultiGraph) GetEdgeIndex(id1, id2 int) (int, bool)

where

Build - builds DCG from the Entities array,
Pred - returns all predecessors of the vertex with internal id v,
Succ - returns all successors of the vertex with internal id u,
UpdateRelation - update DCG from the Entities array; returns Error object if ent array consists new edges,
GetEdgeIndex - returns internal index of MultiEdge,
id1, id2 - Ids of verteces from Entity object,
ent - EntitySeq array of Entity-objects.

NOTE

For getting the internal ids of vertex use g.VertexIndex map of MultiGraph.

Input data

Package provide subroutines for export/import entities from Json files:

func FromJsonFile(fn string) (seq EntitySeq)
func ToJsonFile(fn string, seq EntitySeq)

The entity object must have the structure exposed below

type EntityRaw struct {
	EntityId string `json:"EntityId"`
	Id1      int    `json:"Id1"`
	Id2      int    `json:"Id2"`
	Relation string `json:"Relation"`
}

where

EntityId - unique id of the edge,
Id1, Id2 - not unique ids of verteces,
Relation - value of Entity cost

Operations with DCG

Reading and building graph from JSON file:

	basePath := "./examples"

	graph := new(MultiGraph)
    entities := FromJsonFile(path.Join(basePath, "small.json"))
	graph.Build(entities)

Getting internal edge index

    entity := Entity{Id1: 1, Id2: 2}
    index, getIndexOk := graph.GetEdgeIndex(entity.Id1, entity.Id2)

Getting predecessors and successors of vertex

    entity := Entity{Id1: 1, Id2: 2}
    u := graph.VertexIndex[entity.Id1]
    v := graph.VertexIndex[entity.Id2]
    s := graph.Succ(u)
    p := graph.Pred(v)

Updating edges costs

    updates := FromJsonFile(path.Join(basePath, "small_update.json"))
    err := graph.UpdateRelation(updates)

    if err != nil {
        panic(fmt.Errorf("UpdateRelation() error = %v", err))
    }

More examples of using DCG object see here.

Algorithms

The package includes several algothims:

Depth-First Search with Memoization | source
Recursive Depth-First Search | source
Iterative Depth-First Search | source
Floyd-Warshall | source
Bellman-Ford | source

NOTE

The algorithms 2-5 don't implemented completely. It is part of experimental interfaces of package. Please be careful with using it.

Depth-First Search with Memoization

This is a variant of ID-DFS algorithm. Current implementation have several modification and differences from the original algorithm:

depth of searching limited by concrete value instead of positive infinity in the original algorithm,
using memoization,
reducing multiple edges with the same source and target to optimal edge with least value of weights,
weights calculates as - math.log(entity.Relation),
using weight boundaries and PriorityQueue.

Algorithm includes the next steps:

Reduce multiple edges with the same source and target to optimal edge with least value of weights calculates as - math.log(entity.Relation).
Build Depth-First Search Tree using dfs+memoization+limiting depth by optimal value:

    DFS(source, target, level) returns nodes, stat:
        if inMemo(source, target, level):
            return getMemo(source, target, level)

        stat = {min, max, mean, mean2, pathsCount}
        nodes = array of TreeNode

        for edge as (u,v) in successors(succ):
            if target == v:
                appendTo(nodes, NewTreeNode(edge, "endpoint"))
                calcStatistics(stat, edge)
                continue

            appendTo(nodes, 
                ExpandTreeNodes(edge,DFS(v, target, level+1)))
            calcStatistics(stat, edge)

        setMemo(source, target, level, nodes, stat)
        return nodes, stat

For every TreeNode weight x MIN(x), MAX(x), E(x) and E(x**2) statistics processed and stored. This statistics used for skip not optimal branches for reducing searching area.

Calculate threshold depended by preinitialized mode which reduce Time Complexity of algorithm:

    stat = getMemo(source, target, level) // E(x), E(x**2), pathsCount
	switch mode {
	case THR_ZERO:
		threshold = 0
	case THR_MEAN:
		threshold = E(x)
	case THR_MEAN_STDDEV:
		threshold = E(x) - sqrt((E(x**2)-E(x) ** 2)/pathsCount)
	}

Also user can set the custom value of threhold.

Trace memo object using original dfs algorithm with constraints:

pq = priority queue

TRACE(src int, target int, level int):
	if src < 0 or target < 0 or level < 0:
        return

	nodes, stat = getMemo(source, target, level)

	if psa[level]+stat.minWeight >= threshold:
		return

	if maxWeight != MIN_WEIGHT and
        psa[level]+stat.minWeight > maxWeight:
		return

	for node in nodes:
		edges[level] = node.base
		weight = psa[level] + node.base.weight
		psa[level+1] = weight

		if node.src < 0:
			if weight >= 0:
				continue

			if weight < maxWeight:
				set(pq, edges, weight)
				maxWeight = pq[0].priority

			continue
		}

		TRACE(node.src, node.target, node.level)

Process multiple edges with the same source and target for every optimal edge from pq. This step is true because every path included not optimal edge is worse or equal than the path included only optimal edges. For more information see ProcessOutsideEdges.

Launch Depth-First Search methods

Some Factory interfaces provided by package for creating Searcher-compatible objects:

func NewDfs(name string, deepLimit int) (Searcher, error)
func DfsDo(st Searcher, op string, g *MultiGraph, srcIds []int, targetIds []int, topK int) (pathsb []byte, err error)
func NewSearcher(major string, minor string) (Searcher, error)

where

major - the algorithm's family, e.g. "dfs",
minor, name - particular algorithm, e.g. "colored", "stacked", "memo",
deepLimit - depth of searching limited by this value,
op - one of the "TopK", "TopKOneToOne", "TopKOneToMany".

Benchmarks

Benchmarks above were made for the Depth-First Search with Memoization algorithm with deepLimit=[5, 6]. For details see this.

NOTE

All values in tables exposed in milliseconds.
Testing environment - os: linux, arch: amd64, cpu: Intel(R) Core(TM) i7-8750H CPU @ 2.20GHz.
Test data was generated by GenerateRandomEntitiesJson subroutine.
q25, q50, q75 - are corresponding percentiles 25%, 50%, 75%.

Table 1 - Depth-First Search with Memoization with deepLimit=5

#	fn	mean	min	max	q25	q50	q75
0	TopK	8.4	4.3	22	6.8	7.1	7.5
1	TopKOneToMany	33	6.0	64	17	39	48
2	TopKOneToOne	71	6.9	160	35	69	100
3	UpdateRelation	< 1 ns	< 1 ns	< 1 ns	< 1 ns	< 1 ns	< 1 ns

Table 2 - Depth-First Search with Memoization with deepLimit=6

#	fn	mean	min	max	q25	q50	q75
0	TopK	17	8.9	44	13	14	15
1	TopKOneToMany	71	13	180	29	51	110
2	TopKOneToOne	140	80	320	72	130	190
3	UpdateRelation	< 1 ns	< 1 ns	< 1 ns	< 1 ns	< 1 ns	< 1 ns

Analysis

With the depth equal 5 and top-100 paths all subroutines working time less than 160ms. Maximum working time was detected for TopKOneToOne subroutine for the 15th length input array srcIds.

With the depth equal 6 and top-100 paths all subroutines working time less than 320ms. Maximum working time was detected for TopKOneToOne subroutine for the 15th length input array srcIds. But for 10th length input array srcIds maximum working time is about 200ms.

With both depth values (5 and 6) the subroutine UpdateRelation working time less than 1 ns for updating 20000 edges in graph structure. It is possible due to the internal index maps inside the DCG and MultiEdge objects.

With depths equal or greater 5 paths with internal loops may exist, e.g. chain with length eq 8:

entity-7023 -> entity-1305-b -> entity-291-b -> entity-1992-b -> entity-6958 -> entity-1305-b -> entity-291-b -> entity-18-b

has negative internal loop from vertex with id 15 to the same vertex:

entity-1305-b -> entity-291-b -> entity-1992-b -> entity-6958

This path is the best with the maximum relation value and the Depth-First Search with Memoization algorithm allows to find it.

Compare results from this and that its obvious that the Depth-First Search with Memoization algorithm allows to find more paths with greater cost than the Recursive and Iterative classic Depth-First Search algorithms with skipping visited verteces.

Conclusion

Using Depth-First Search algorithm with Memoization appropriate for systems with the rigorous time constraints (e.g. > 5000 verteces and > 20000 edges).
For best performance find compromises between depth and size of top-k sequence (e.g. for depth=8 use top-10 sequence, for depth=5 use top-100 sequence, for depth=6 use top-50 or less sequence).
For dense graph performance of the package subroutines may be worse.
Choose the threshold parameter or algorithm for it processing according to density of Relation parameter.
Depth-First Search with Memoization algorithm allows to find more paths with greater cost than the Recursive and Iterative classic Depth-First Search algorithms with skipping visited verteces.
Algorithms such Bellman-Ford and Floyd-Warshall not appropriate for DCG with large count of vertices and edges due to low performance (more than several seconds for full paths task). Besides Bellman-Ford and Floyd-Warshall algorithms don't correct working with nested negative cycles.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.github/workflows		.github/workflows
benchmark		benchmark
examples		examples
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
bellman_ford.go		bellman_ford.go
benchmark.ipynb		benchmark.ipynb
dfs.go		dfs.go
dfs_colored.go		dfs_colored.go
dfs_memo.go		dfs_memo.go
dfs_memo_dgraph_comp_test.go		dfs_memo_dgraph_comp_test.go
dfs_memo_lo_test.go		dfs_memo_lo_test.go
dfs_memo_test.go		dfs_memo_test.go
dfs_stacked.go		dfs_stacked.go
dfs_stacked_test.go		dfs_stacked_test.go
edge_provider.go		edge_provider.go
edge_provider_test.go		edge_provider_test.go
entity.go		entity.go
entity_test.go		entity_test.go
floyd_warshall.go		floyd_warshall.go
go.mod		go.mod
go.sum		go.sum
limit_order_service.go		limit_order_service.go
multi_edge.go		multi_edge.go
multi_graph.go		multi_graph.go
multi_graph_test.go		multi_graph_test.go
priority_queue.go		priority_queue.go
relation.ipynb		relation.ipynb
requirements.txt		requirements.txt
searcher.go		searcher.go
searcher_factory.go		searcher_factory.go
single_edge.go		single_edge.go
utils.go		utils.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

kSPA - The k-Shortest Paths Algorithms Go package

Base interface

Directed Cyclic Graph

Input data

Operations with DCG

Algorithms

Depth-First Search with Memoization

Launch Depth-First Search methods

Benchmarks

Table 1 - Depth-First Search with Memoization with deepLimit=5

Table 2 - Depth-First Search with Memoization with deepLimit=6

Analysis

Conclusion

About

Uh oh!

Releases

Packages

Languages

License

Sunnesoft/kspa

Folders and files

Latest commit

History

Repository files navigation

kSPA - The k-Shortest Paths Algorithms Go package

Base interface

Directed Cyclic Graph

Input data

Operations with DCG

Algorithms

Depth-First Search with Memoization

Launch Depth-First Search methods

Benchmarks

Table 1 - Depth-First Search with Memoization with deepLimit=5

Table 2 - Depth-First Search with Memoization with deepLimit=6

Analysis

Conclusion

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages