Go to file
Alex Sharov e1ad20460c
Less obj db in snapshot (#2142)
* less obj db in snapshot

* less obj db in snapshot

* save
2021-06-11 09:35:26 +01:00
.buildkite Rename to Erigon (#2018) 2021-05-26 11:35:39 +01:00
.github Update ci.yml 2021-06-05 11:05:02 +01:00
accounts/abi TEVM status pruning (#2034) 2021-06-11 09:34:25 +01:00
cmd Less obj db in snapshot (#2142) 2021-06-11 09:35:26 +01:00
common TEVM status pruning (#2034) 2021-06-11 09:34:25 +01:00
consensus Do not remove headers when unwinding (#2122) 2021-06-08 16:16:27 +01:00
core TEVM status pruning (#2034) 2021-06-11 09:34:25 +01:00
crypto up version of uint256 lib (#2082) 2021-06-04 17:25:28 +01:00
design auto-format code by prettier (similar to gofmt) (#405) 2020-03-25 12:45:21 +07:00
docs Add files via upload 2021-06-08 16:38:42 +01:00
eth tevm_open_cursor_after_commit (#2141) 2021-06-11 09:35:00 +01:00
ethdb txpool_content (#2127) 2021-06-11 09:34:47 +01:00
ethstats Rename to Erigon (#2018) 2021-05-26 11:35:39 +01:00
event rename (#1978) 2021-05-20 19:25:53 +01:00
gointerfaces txpool_content (#2127) 2021-06-11 09:34:47 +01:00
interfaces txpool_content (#2127) 2021-06-11 09:34:47 +01:00
internal up version of uint256 lib (#2082) 2021-06-04 17:25:28 +01:00
log Rename to Erigon (#2018) 2021-05-26 11:35:39 +01:00
metrics return mem metrics (#2117) 2021-06-07 21:18:40 +07:00
migrations Simulated backend and genesis to kv (#2104) 2021-06-05 16:17:04 +01:00
node extract p2p config out of sentry (#2126) 2021-06-11 09:34:37 +01:00
p2p Add db label (later will add txpool db) (#2081) 2021-06-04 15:56:49 +01:00
params extract p2p config out of sentry (#2126) 2021-06-11 09:34:37 +01:00
rlp up version of uint256 lib (#2082) 2021-06-04 17:25:28 +01:00
rpc Rpcdaemon batch (#2105) 2021-06-05 15:10:52 +01:00
src Rpcdaemon: move tx pool to own grpc service, subscribe to pending txs (#1863) 2021-05-04 08:37:17 +07:00
tests txpool_content (#2127) 2021-06-11 09:34:47 +01:00
turbo txpool_content (#2127) 2021-06-11 09:34:47 +01:00
visual Continue comparison of genesis block with geth, expand long values (#223) 2019-12-06 12:03:12 +00:00
.dockerignore remove debug-web-ui and restapi (#1480) 2021-02-09 19:42:14 +07:00
.gitattributes .gitattributes: enable solidity highlighting on github (#16425) 2018-04-03 15:21:24 +02:00
.gitignore Win build patch (#2027) 2021-05-27 11:34:44 +01:00
.gitmodules Remove z3 and semantics (#1314) 2020-10-28 17:52:00 +00:00
.golangci.yml Snapshot sync headers stage (#1836) 2021-05-08 09:45:40 +01:00
.readthedocs.yml first sphinx doc portion (#1144) 2020-09-27 20:40:48 +01:00
AUTHORS build: deduplicate same authors with different casing 2019-07-22 12:31:11 +03:00
COPYING COYPING: restore the full text text of GPL (#21568) 2020-10-06 14:12:09 +02:00
COPYING.LESSER all: update license information 2015-07-07 14:12:44 +02:00
docker-compose.yml Rename to Erigon (#2018) 2021-05-26 11:35:39 +01:00
Dockerfile Add repo tag to build info (#2108) 2021-06-06 09:05:30 +01:00
go.mod txpool_content (#2127) 2021-06-11 09:34:47 +01:00
go.sum txpool_content (#2127) 2021-06-11 09:34:47 +01:00
interfaces.go rename (#1978) 2021-05-20 19:25:53 +01:00
Makefile txpool_content (#2127) 2021-06-11 09:34:47 +01:00
nightly.sh Nightly tests (#444) 2020-04-12 19:37:15 +01:00
oss-fuzz.sh Rename to Erigon (#2018) 2021-05-26 11:35:39 +01:00
README.md Update README.md 2021-06-06 20:27:01 +07:00
RELEASE_INSTRUCTIONS.md Rename to Erigon (#2018) 2021-05-26 11:35:39 +01:00
TESTING.md Rename to Erigon (#2018) 2021-05-26 11:35:39 +01:00
win-build.ps1 extract p2p config out of sentry (#2126) 2021-06-11 09:34:37 +01:00

Erigon

Erigon is an implementation of Ethereum (aka "Ethereum client"), on the efficiency frontier, written in Go.

Build status

NB! In-depth links are marked by the microscope sign (🔬)

Disclaimer: this software is currenly a tech preview. We will do our best to keep it stable and make no breaking changes but we don't guarantee anything. Things can and will break.

The current version is currently based on Go-Ethereum 1.10.1

System Requirements

Recommend 2Tb storage space on a single partition: 1Tb state, 200GB temp files (can symlink or mount folder <datadir>/etl-tmp to another disk).

RAM: 16GB, 64-bit architecture, Golang version >= 1.16

🔬 more info on disk storage is here)

Usage

Getting Started

> git clone --recurse-submodules -j8 https://github.com/ledgerwatch/erigon.git
> cd erigon
> make erigon
> ./build/bin/erigon

Testnets

If you would like to give Erigon a try, but do not have spare 2Tb on your driver, a good option is to start syncing one of the public testnets, Görli. It syncs much quicker, and does not take so much disk space:

> git clone --recurse-submodules -j8 https://github.com/ledgerwatch/erigon.git
> cd erigon
> make erigon
> ./build/bin/erigon --datadir goerli --chain goerli

Please note the --datadir option that allows you to store Erigon files in a non-default location, in this example, in goerli subdirectory of the current directory. Name of the directory --datadir does not have to match the name if the chain in --chain.

Mining

Support only remote-miners.

  • To enable, add --mine --miner.etherbase=... or --mine --miner.miner.sigkey=... flags.
  • Other supported options: --miner.extradata, --miner.notify, --miner.gaslimit, --miner.gasprice , --miner.gastarget
  • RPCDaemon supports methods: eth_coinbase , eth_hashrate, eth_mining, eth_getWork, eth_submitWork, eth_submitHashrate
  • RPCDaemon supports websocket methods: newPendingTransaction
  • TODO:
    • we don't broadcast mined blocks to p2p-network yet, but it's easy to accomplish
    • eth_newPendingTransactionFilter
    • eth_newBlockFilter
    • eth_newFilter
    • websocket Logs

🔬 Detailed mining explanation is here.

Windows

Windows users may run erigon in 3 possible ways:

  • Build executable binaries natively for Windows using provided win-build.ps1 PowerShell script which has to be run with local Administrator privileges. The script creates libmdbx.dll (MDBX is current default database for Erigon) and copies it into Windows's system32 folder (generally C:\Windows\system32). There are some requirements for a successful native build on windows :

    • Git for Windows must be installed. If you're cloning this repository is very likely you already have it
    • GO Programming Language must be installed. Minimum required version is 1.16
    • Chocolatey package manager for Windows must be installed. By Chocolatey you need to install the following components : cmake, make, mingw by choco install cmake make mingw.

    Important note about Anti-Viruses During MinGW's compiler detection phase some temporary executables are generated to test compiler capabilities. It's been reported some anti-virus programs detect those files as possibly infected by Win64/Kryptic.CIS trojan horse (or a variant of it). Although those are false positives we have no control over 100+ vendors of security products for Windows and their respective detection algorythms and we understand this might make your experience with Windows builds uncomfortable. To workaround the issue you might either set exlusions for your antivirus specifically for ethdb\mdbx\dist\CMakeFiles folder or you can run erigon on Docker or WSL

    Though is still possible to run erigon with LMDB database there's a caveat which might cause your experience with LMDB on Windows uncomfortable: data file allocation is fixed so you need to know in advance how much space you want to allocate for database file using the command line option --lmdb.mapSize. Please be advised Erigon will completely remove LMDB support in future releases thus we warmly suggest to resync using the default MDBX database.

  • Use Docker : see docker-compose.yml

  • Use WSL (Windows Subsystem for Linux) strictly on version 2. Under this option you can build Erigon just as you would on a regular Linux distribution. You can point your data also to any of the mounted Windows partitions (eg. /mnt/c/[...], /mnt/d/[...] etc) but in such case be advised performance is impacted: this is due to the fact those mount points use DrvFS which is a network file system and, additionally, MDBX locks the db for exclusive access which implies only one process at a time can access data. This has consequences on the running of rpcdaemon which has to be configured as Remote DB even if it is executed on the very same computer. If instead your data is hosted on the native Linux filesystem non limitations apply. Please also note the default WSL2 environment has its own IP address which does not match the one of the network interface of Windows host: take this into account when configuring NAT for port 30303 on your router.

Key features

🔬 See more detailed overview of functionality and current limitations. It is being updated on recurring basis.

More Efficient State Storage

Flat KV storage. Erigon uses a key-value database and storing accounts and storage in a simple way.

🔬 See our detailed DB walkthrough here.

Preprocessing. For some operations, Erigon uses temporary files to preprocess data before inserting it into the main DB. That reduces write amplification and DB inserts are orders of magnitude quicker.

🔬 See our detailed ETL explanation here.

Plain state.

Single accounts/state trie. Erigon uses a single Merkle trie for both accounts and the storage.

Faster Initial Sync

Erigon uses a rearchitected full sync algorithm from Go-Ethereum that is split into "stages".

🔬 See more detailed explanation in the Staged Sync Readme

It uses the same network primitives and is compatible with regular go-ethereum nodes that are using full sync, you do not need any special sync capabilities for Erigon to sync.

When reimagining the full sync, we focused on batching data together and minimize DB overwrites. That makes it possible to sync Ethereum mainnet in under 2 days if you have a fast enough network connection and an SSD drive.

Examples of stages are:

  • Downloading headers;

  • Downloading block bodies;

  • Recovering senders' addresses;

  • Executing blocks;

  • Validating root hashes and building intermediate hashes for the state Merkle trie;

  • [...]

JSON-RPC daemon

In Erigon RPC calls are extracted out of the main binary into a separate daemon. This daemon can use both local or remote DBs. That means, that this RPC daemon doesn't have to be running on the same machine as the main Erigon binary or it can run from a snapshot of a database for read-only calls.

🔬 See RPC-Daemon docs

For local DB

This is only possible if RPC daemon runs on the same computer as Erigon. This mode of operation uses shared memory access to the database of Erigon, which is reported to have better performance than accessing via TPC socket (see "For remote DB" section below)

> make rpcdaemon
> ./build/bin/rpcdaemon --datadir ~/Library/Erigon/ --http.api=eth,debug,net

In this mode, some RPC API methods do not work. Please see "For dual mode" section below on how to fix that.

For remote DB

This works regardless of whether RPC daemon is on the same computer with Erigon, or on a different one. They use TPC socket connection to pass data between them. To use this mode, run Erigon in one terminal window

> ./build/bin/erigon --private.api.addr=localhost:9090

Run RPC daemon

> ./build/bin/rpcdaemon --private.api.addr=localhost:9090 --http.api=eth,debug,net

gRPC ports: 9090 erigon, 9091 sentry, 9092 consensus engine, 9093 snapshot downloader, 9094 TxPool

For dual mode

If both --datadir and --private.api.addr options are used for RPC daemon, it works in a "dual" mode. This only works when RPC daemon is on the same computer as Erigon. In this mode, most data transfer from Erigon to RPC daemon happens via shared memory, only certain things (like new header notifications) happen via TPC socket.

Supported JSON-RPC calls (eth, debug, net, web3):

For a details on the implementation status of each command, see this table.

Run all components by docker-compose

Next command starts: Erigon on port 30303, rpcdaemon 8545, prometheus 9090, grafana 3000

docker-compose build
XDG_DATA_HOME=/preferred/data/folder docker-compose up

Grafana dashboard

docker-compose up prometheus grafana, detailed docs.

FAQ

How much RAM do I need

  • Baseline (ext4 SSD): 16Gb RAM sync takes 5 days, 32Gb - 4 days, 64Gb - 3 days
  • +1 day on "zfs compression=off". +2 days on "zfs compression=on" (2x compression ratio). +3 days on btrfs.
  • -1 day on NVMe

Detailed explanation: ./docs/programmers_guide/db_faq.md

Getting in touch

Erigon Discord Server

The main discussions are happening on our Discord server. To get an invite, send an email to tg [at] torquem.ch with your name, occupation, a brief explanation of why you want to join the Discord, and how you heard about Erigon.

Reporting security issues/concerns

Send an email to security [at] torquem.ch.

Team

Core contributors (in alpabetical order of first names):

Thanks to:

  • All contributors of Erigon

  • All contributors of Go-Ethereum

  • Our special respect and graditude is to the core team of Go-Ethereum. Keep up the great job!

Happy testing! 🥤

Known issues

htop shows incorrect memory usage when using LMDB

Erigon's internal DB (LMDB) using MemoryMap - when OS does manage all read, write, cache operations instead of Application (linux, windows)

htop on column res shows memory of "App + OS used to hold page cache for given App", but it's not informative, because if htop says that app using 90% of memory you still can run 3 more instances of app on the same machine - because most of that 90% is "OS pages cache".
OS automatically free this cache any time it needs memory. Smaller "page cache size" may not impact performance of Erigon at all.

Next tools show correct memory usage of Erigon:

  • vmmap -summary PID | grep -i "Physical footprint". Without grep you can see details - section MALLOC ZONE column Resident Size shows App memory usage, section REGION TYPE column Resident Size shows OS pages cache size.
  • Prometheus dashboard shows memory of Go app without OS pages cache (make prometheus, open in browser localhost:3000, credentials admin/admin)
  • cat /proc/<PID>/smaps

Erigon uses ~4Gb of RAM during genesis sync and ~1Gb during normal work. OS pages cache can utilize unlimited amount of memory.

Warning: Multiple instances of Erigon on same machine will touch Disk concurrently, it impacts performance - one of main Erigon optimisations: "reduce Disk random access". "Blocks Execution stage" still does much random reads - this is reason why it's slowest stage. We do not recommend run multiple genesis syncs on same Disk. If genesis sync passed, then it's fine to run multiple Erigon on same Disk.

Blocks Execution is slow on cloud-network-drives

Please read https://github.com/ledgerwatch/erigon/issues/1516#issuecomment-811958891 In short: network-disks are bad for blocks execution - because blocks execution reading data from db non-parallel non-batched way.

rpcdaemon "Dual-Mode" does not work with Docker Container

Running rpcdaemon in "Dual-Mode" (including the --datadir flag) generally results in better performance for RPC calls, however, this does not work when running erigon and rpcdaemon in separate containers. For the absolute best performance bare metal is recommended at this time.