Go to file
Artem Vorotnikov e67be2b1be
Remove unused code (#1604)
* Remove graphql, legacy options

* Remove CODEOWNERS

* Remove EWASM and custom EVM interpreter
2021-03-26 09:08:01 +07:00
.buildkite Nightly tests (#444) 2020-04-12 19:37:15 +01:00
.circleci Update config.yml (#1564) 2021-03-19 17:36:30 +00:00
.github Remove unused code (#1604) 2021-03-26 09:08:01 +07:00
.golangci grpc version up (#1568) 2021-03-20 16:35:13 +07:00
accounts Remove unused code (#1604) 2021-03-26 09:08:01 +07:00
cmd Remove unused code (#1604) 2021-03-26 09:08:01 +07:00
common Remove unused code (#1604) 2021-03-26 09:08:01 +07:00
consensus Remove unused code (#1604) 2021-03-26 09:08:01 +07:00
console/prompt Granular rpc control (Allow list for RPC daemon) (#1341) 2020-11-10 10:08:42 +01:00
core Remove unused code (#1604) 2021-03-26 09:08:01 +07:00
crypto and more lint fixes 2021-03-18 11:53:34 +01:00
design auto-format code by prettier (similar to gofmt) (#405) 2020-03-25 12:45:21 +07:00
docs More updates to downloader, new p2psentry protocol (#1559) 2021-03-19 21:24:49 +00:00
eth Remove unused code (#1604) 2021-03-26 09:08:01 +07:00
ethdb Remove unused code (#1604) 2021-03-26 09:08:01 +07:00
ethstats core, eth: split eth package, implement snap protocol (#21482) 2021-03-08 17:07:38 +01:00
event Fix lints 2021-03-18 07:27:50 +00:00
gointerfaces Mining: use noop writer when applyTx, pending logs subscription (#1600) 2021-03-25 13:42:45 +07:00
interfaces Mining: use noop writer when applyTx, pending logs subscription (#1600) 2021-03-25 13:42:45 +07:00
internal revert 2021-03-21 17:50:05 +07:00
log Rpcdaemon as lib (#940) 2020-08-19 12:46:20 +01:00
metrics and more lint fixes 2021-03-18 11:53:34 +01:00
migrations Mining stage (#1554) 2021-03-23 16:00:07 +07:00
miner Mining stage (#1554) 2021-03-23 16:00:07 +07:00
node Remove unused code (#1604) 2021-03-26 09:08:01 +07:00
p2p KV: Split RO and RW transactions (#1577) 2021-03-21 20:15:25 +07:00
params Remove unused code (#1604) 2021-03-26 09:08:01 +07:00
rlp Lint fixes 2021-03-17 20:45:33 +00:00
rpc Lint fixes 2021-03-17 20:45:33 +00:00
signer more linter fixes 2021-03-18 11:44:22 +01:00
tests Remove unused code (#1604) 2021-03-26 09:08:01 +07:00
turbo Remove unused code (#1604) 2021-03-26 09:08:01 +07:00
visual Continue comparison of genesis block with geth, expand long values (#223) 2019-12-06 12:03:12 +00:00
.dockerignore remove debug-web-ui and restapi (#1480) 2021-02-09 19:42:14 +07:00
.gitattributes .gitattributes: enable solidity highlighting on github (#16425) 2018-04-03 15:21:24 +02:00
.gitignore post-rebase fixes 2020-12-03 18:59:17 +01:00
.gitmodules Remove z3 and semantics (#1314) 2020-10-28 17:52:00 +00:00
.golangci.yml adopt --metrics.addr flag in integration (#889) 2020-08-11 06:38:34 +07:00
.mailmap all: update license information (#16089) 2018-02-14 13:49:11 +01:00
.readthedocs.yml first sphinx doc portion (#1144) 2020-09-27 20:40:48 +01:00
.travis.yml travis: manually install Android since Travis is stale (#22373) 2021-03-10 17:03:02 +01:00
appveyor.yml travis, appveyor, build: bump Go to 1.16 2021-03-10 16:54:58 +01:00
AUTHORS build: deduplicate same authors with different casing 2019-07-22 12:31:11 +03:00
circle.yml circleci: enable docker based hive testing 2016-07-15 16:07:34 +03:00
COPYING COYPING: restore the full text text of GPL (#21568) 2020-10-06 14:12:09 +02:00
COPYING.LESSER all: update license information 2015-07-07 14:12:44 +02:00
docker-compose.yml add docker-compose udp port 2021-03-11 07:07:35 +07:00
Dockerfile go 1.16 support (#1509) 2021-03-19 16:32:52 +07:00
go.mod Remove unused code (#1604) 2021-03-26 09:08:01 +07:00
go.sum Remove unused code (#1604) 2021-03-26 09:08:01 +07:00
interfaces.go all: add support for EIP-2718, EIP-2930 transactions (#21502) 2021-03-12 15:40:24 +01:00
Makefile Update Makefile (#1583) 2021-03-22 08:31:42 +00:00
nightly.sh Nightly tests (#444) 2020-04-12 19:37:15 +01:00
oss-fuzz.sh fix rlp fuzzer issue 2021-03-17 10:44:23 +01:00
README.geth.md eth_syncing (#991) 2020-08-29 08:24:50 +01:00
README.md RPCDaemon: increase throughput by allowing more parallel grpc streams and not lockthread for read tx (#1580) 2021-03-23 14:28:04 +07:00
RELEASE_INSTRUCTIONS.md Jumpdest skip optimisation (#851) 2020-08-01 17:56:57 +01:00
SECURITY.md SECURITY.md: link to release page (#22067) 2021-03-09 10:14:23 +01:00
TESTING.md Create TESTING.md (#1597) 2021-03-24 14:05:55 +00:00
UPGRADE_INFO.md prepare for merging 2020-02-27 17:20:35 +03:00

Turbo-Geth

Turbo-Geth is a fork of Go-Ethereum with focus on performance. CircleCI

NB! In-depth links are marked by the microscope sign (🔬)

Disclaimer: this software is currenly a tech preview. We will do our best to keep it stable and make no breaking changes but we don't guarantee anything. Things can and will break.

The current version is currently based on Go-Ethereum 1.10.1

System Requirements

Recommend 2Tb storage space on a single partition: 1Tb state, 200GB temp files (can symlink or mount folder <datadir>/etl-tmp to another disk).

RAM: 16GB, 64-bit architecture, (Golang version >= 1.15.6](https://golang.org/doc/install)

🔬 more info on disk storage is here here)

Usage

Getting Started

> git clone --recurse-submodules -j8 https://github.com/ledgerwatch/turbo-geth.git
> cd turbo-geth
> make tg
> ./build/bin/tg

Testnets

If you would like to give turbo-geth a try, but do not have spare 2Tb on your driver, a good option is to start syncing one of the public testnets, Görli. It syncs much quicker, and does not take so much disk space:

> git clone --recurse-submodules -j8 https://github.com/ledgerwatch/turbo-geth.git
> cd turbo-geth
> make tg
> ./build/bin/tg --datadir goerli --goerli

Please note the --datadir option that allows you to store turbo-geth files in a non-default location, in this example, in goerli subdirectory of the current directory.

Windows

Windows users may run turbo-geth in 3 possible ways:

  • Build tg binaries natively for Windows : while this method is possible we still lack a fully automated build process thus, at the moment, is not to be preferred. Besides there's also a caveat which might cause your experience with TG as native on Windows uncomfortable: data file allocation is fixed so you need to know in advance how much space you want to allocate for database file using the option --lmdb.mapSize

  • Use Docker : see docker-compose.yml

  • Use WSL (Windows Subsystem for Linux) : You can easily install WSL following this quickstart guide. Is also suggested the reading of interoperability amongst Windows and Linux work. Once your WSL environment is ready with your preferred Kernel distribution (for this document we assume you've choosen Ubuntu) proceed to install (in the linux subsystem) the required components:

> sudo apt install build-essential git golang golang-go

Once this last step is completed you can run tg as if you were on Linux as described the Usage section.

Note : WSL native filesystem is set to reside in the same partition of Windows' system partition (usually C:). Unless this is the only partition of your system is advisable to have TG store its data in a different partition. Say your Windows system has a secondary partition D: WSL environment sees this partition as /mnt/dso to have TG store its data there you will haave to launch TG as

> ./tg --datadir /mnt/d/[<optional-subfolder>/]

Key features

🔬 See more detailed overview of functionality and current limitations. It is being updated on recurring basis.

More Efficient State Storage

Flat KV storage. Turbo-Geth uses a key-value database and storing accounts and storage in a simple way.

🔬 See our detailed DB walkthrough here.

Preprocessing. For some operations, turbo-geth uses temporary files to preprocess data before inserting it into the main DB. That reduces write amplification and DB inserts are orders of magnitude quicker.

🔬 See our detailed ETL explanation here.

Plain state.

Single accounts/state trie. Turbo-Geth uses a single Merkle trie for both accounts and the storage.

Faster Initial Sync

Turbo-Geth uses a rearchitected full sync algorithm from Go-Ethereum that is split into "stages".

🔬 See more detailed explanation in the Staged Sync Readme

It uses the same network primitives and is compatible with regular go-ethereum nodes that are using full sync, you do not need any special sync capabilities for turbo-geth to sync.

When reimagining the full sync, we focused on batching data together and minimize DB overwrites. That makes it possible to sync Ethereum mainnet in under 2 days if you have a fast enough network connection and an SSD drive.

Examples of stages are:

  • Downloading headers;

  • Downloading block bodies;

  • Executing blocks;

  • Validating root hashes and building intermediate hashes for the state Merkle trie;

  • And more...

JSON-RPC daemon

In turbo-geth RPC calls are extracted out of the main binary into a separate daemon. This daemon can use both local or remote DBs. That means, that this RPC daemon doesn't have to be running on the same machine as the main turbo-geth binary or it can run from a snapshot of a database for read-only calls.

🔬 See RPC-Daemon docs

For local DB

This is only possible if RPC daemon runs on the same computer as turbo-geth. This mode of operation uses shared memory access to the database of turbo-geth, which is reported to have better performance than accessing via TPC socket (see "For remote DB" section below)

> make rpcdaemon
> ./build/bin/rpcdaemon --chaindata ~/Library/TurboGeth/tg/chaindata --http.api=eth,debug,net

In this mode, some RPC API methods do not work. Please see "For dual mode" section below on how to fix that.

For remote DB

This works regardless of whether RPC daemon is on the same computer with turbo-geth, or on a different one. They use TPC socket connection to pass data between them. To use this mode, run turbo-geth in one terminal window

> ./build/bin/tg --private.api.addr=localhost:9090

Run RPC daemon

> ./build/bin/rpcdaemon --private.api.addr=localhost:9090 --http.api=eth,debug,net

For dual mode

If both --chaindata and --private.api.addr options are used for RPC daemon, it works in a "dual" mode. This only works when RPC daemon is on the same computer as turbo-geth. In this mode, most data transfer from turbo-geth to RPC daemon happens via shared memory, only certain things (like new header notifications) happen via TPC socket.

Supported JSON-RPC calls (eth, debug, net, web3):

For a details on the implementation status of each command, see this table.

Run all components by docker-compose

Next command starts: turbo-geth on port 30303, rpcdaemon 8545, prometheus 9090, grafana 3000

docker-compose build
XDG_DATA_HOME=/preferred/data/folder docker-compose up

Grafana dashboard

docker-compose up prometheus grafana, detailed docs.

Getting in touch

Turbo-Geth Discord Server

The main discussions are happening on our Discord server. To get an invite, send an email to tg [at] torquem.ch with your name, occupation, a brief explanation of why you want to join the Discord, and how you heard about Turbo-Geth.

Reporting security issues/concerns

Send an email to security [at] torquem.ch.

Team

Core contributors:

Thanks to:

  • All contributors of Turbo-Geth

  • All contributors of Go-Ethereum

  • Our special respect and graditude is to the core team of Go-Ethereum. Keep up the great job!

Happy testing! 🥤

Known issues

htop shows incorrect memory usage

TurboGeth's internal DB (LMDB) using MemoryMap - when OS does manage all read, write, cache operations instead of Application (linux, windows)

htop on column res shows memory of "App + OS used to hold page cache for given App", but it's not informative, because if htop says that app using 90% of memory you still can run 3 more instances of app on the same machine - because most of that 90% is "OS pages cache".
OS automatically free this cache any time it needs memory. Smaller "page cache size" may not impact performance of TurboGeth at all.

Next tools show correct memory usage of TurboGeth:

  • vmmap -summary PID | grep -i "Physical footprint". Without grep you can see details - section MALLOC ZONE column Resident Size shows App memory usage, section REGION TYPE column Resident Size shows OS pages cache size.
  • Prometheus dashboard shows memory of Go app without OS pages cache (make prometheus, open in browser localhost:3000, credentials admin/admin)
  • cat /proc/<PID>/smaps

TurboGeth uses ~4Gb of RAM during genesis sync and < 1Gb during normal work. OS pages cache can utilize unlimited amount of memory.

Warning: Multiple instances of TG on same machine will touch Disk concurrently, it impacts performance - one of main TG optimisations: "reduce Disk random access". "Blocks Execution stage" still does much random reads - this is reason why it's slowest stage. We do not recommend run multiple genesis syncs on same Disk. If genesis sync passed, then it's fine to run multiple TG on same Disk.