Time-Series Data Engine for Market Orders

This is a simplfied implementation of a data warehouse to support efficient researches and simulations with large data volumes, written using C++17 with no external libraries. The internal implementation of data compression and structure for efficient time-based queries take heavy inspiration from other popular time-series database soultions: InfluxDB and Prometheus.

Compiler and architecure support

This project uses CMake, and have been tested for stable run on MacOS and Linux(Ubuntu). Every commit in main is tested with GCC on armv6, armv7, aarch64, riscv64, s390x, and ppc64le. Require C++17 for the usage of std::filesystem Extensive usage of C++11 and C++14 features for effcient codebase

Quick Start

First, clone and build

# Whereever you cloned this codebase to
$ cmake .
$ make

Then, start the interactive engine client

$ ./data_engine

On startup, you will be ask to input a path for the persistence storage. Simply type 0 to use the default location, which is in the local directory.

Engine Client Usage

This will give the basic instructions on how to test this client implementations

Insertion

[Load from file for a single symbol]
        LOAD <file_path> <symbol>

[Insert a single order]
        INSERT <symbol> <epoch> <id> <side:BUY/SELL> <category:NEW/TRADE/CANCEL> <price> <quantity>

[insert one order into database - use engine directly for file ingestions]
        INSERT <symbol> AT <epoch> VALUES <id> <side:BUY/SELL> <category:NEW/TRADE/CANCEL> <price> <quantity>

[delete order by epoch-id pair for a symbol]
        DELETE <symbol> WITH <epoch> <id>

[update order by epoch-id pair for a symbol with other values]
        UPDATE <symbol> WITH <epoch> <id> VALUES <side:BUY/SELL> <category:NEW/TRADE/CANCEL> <price> <quantity>

Query

The engine client currently supports a variety of query methods

[Query single symbol - At only 1 timestamp]
        FROM <symbol> AT <epoch> QUERY <data>

[Query single symbol - Within a range and granularity]
        FROM <symbol> RANGE <start> <end> <granularity> QUERY <data>

[Query multiple symbols - At only 1 timestamp]
        FROM_MULTIPLE <symbol_1> <symbol_2> ... <symbol_n> AT <epoch> QUERY <data>

[Query multiple symbols - Within a range and granularity]
        FROM_MULTIPLE <symbol_1> <symbol_2> ... <symbol_n> RANGE <start> <end> <granularity> QUERY <data>

Each result line will always contains the symbol and epoch Data fields supported for fine-grained queries: buy1, buy2, buy3, buy4, buy5, sell1, sell2, sell3, sell4, sell5, last_trade

To query all, type ALL after QUERY, for example: FROM SCH AT 1609722840752518773 QUERY ALL

Delete/ Update

For this implementation, I don't support fine-grained delete and update of order rows, as there is no efficient way for me to efficiently re-compressed the data after update/delete

Other Commands

[Exit the engine client gracefully]
        QUIT

[Delete all data inside the storage]
        NUKE_DB

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.github/workflows		.github/workflows
docs		docs
include		include
src		src
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Time-Series Data Engine for Market Orders

Contents

Compiler and architecure support

Quick Start

Engine Client Usage

Insertion

Query

Delete/ Update

Other Commands

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

jushg/market_time_series_db

Folders and files

Latest commit

History

Repository files navigation

Time-Series Data Engine for Market Orders

Contents

Compiler and architecure support

Quick Start

Engine Client Usage

Insertion

Query

Delete/ Update

Other Commands

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages