repair-node

Academic attempt at Repair algorithm

The goal of this algorithm is to process a file of words (represented by or converted uniquely into integers), and to find the most common occurring pairs of words and phrases, in order to partition the document for more efficient indexing with multiple versions.

Currently, this code can process an array of ints, Ex: [2,5,6,4,3,7,1,4] and build the replacement pair table which can be used to construct a tree for partitioning. It can also reverse a pair table back into the original document (in order to verify correctness).

To Run

Requires node to be installed.
Run npm install, outside package "underscore" used. (defined in package.json)
Run node process.js or npm start

Notes & Todo

The most inefficient part is currently the pair scan, particularly the indexSearch() function, which iterates through an array of pair objects, already inside of a for loop making it grow exponentially. Need to implement a faster lookup.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
README.md		README.md
package.json		package.json
process.js		process.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

repair-node

To Run

Notes & Todo

About

Uh oh!

Releases

Packages

Languages

pashariger/repair-node

Folders and files

Latest commit

History

Repository files navigation

repair-node

To Run

Notes & Todo

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages