erigon-pulse/ethdb/AbstractKV.md
Alex Sharov c5ffc971c5
[WIP] Badger v2 (#378)
* badger v2 investigation

* buf pool - use native New method and avoid double checks

* db.Open prototype

* db.Tx/Bucket/Cursor prototypes

* Chained config

* Item concept added

* save changes to test on master

* make hack resumable

* Design document v0

* Cursor concept

* less brackets syntax of cursor builder

* benchmarks

* cleanup fs

* test for context cancelations

* test for context cancelations

* test for cursor.Prefix option

* add ForEachKey method

* add ForEachKey method

* add naming explanation

* experiment of non-pointers cursor/bucket

* .Bucket() and .Cursor() doesn't returns error

* .Bucket() and .Cursor() doesn't returns error

* .Bucket() and .Cursor() doesn't returns error

* remove CursorOpts concept

* more test-cases

* simplify open api

* Tx, Bucket, Cursor - now are interfaces

* Tx, Bucket, Cursor - now are interfaces

* switch to interfaces

* rebase master

Co-authored-by: alex.sharov <alex.sharov@lazada.com>
2020-03-11 11:02:37 +00:00

4.3 KiB
Raw Blame History

Vision:

Ethereum gives users a powerful resource (which is hard to give) which is not explicitely priced - transaction atomicity and "serialisable" isolation (the highest level of isolation you can get in the databases). Which means that transaction does not even need to declare in advance what it wants to lock, the entire state is deemed "locked" for its execution. I wonder if the weaker isolation models would make sense. For example, "read committed"

with the weaker isolation, you might be able to split any transaction into smaller parts, each of which does not perform any Dynamic State Access (I have no proof of that though).

It is similar to Tendermint strategy, but even more granular. You can view it as a support for "continuations". Transactions starts, and whenever it hits dynamic access, its execution stops, gas is charged, and the continuation is added to the state. Then, transaction can be resumed, because by committing to some continuation, it makes its next dynamic state access static.

Design principles:

  • No internal copies/allocations - all must be delegated to user. Make it part of contract - written clearly in docs, because it's unsafe (unsafe to put slice to DB and then change it). Known problems: mutation.Put does copy internally.
  • Low-level API: as close to original Bolt/Badger as possible.
  • Expose concept of transaction - app-level code can .Rollback() or .Commit() at once.

Abstraction to support:

Buckets concept:

  • Bucket is an interface, cant be nil, can't return error
  • For Badger - auto-remove bucket from key prefix

InMemory and ReadOnly modes:

  • db.InMemDb() or db.Opts().InMem(true)

Context:

  • For transactions - yes
  • For .First() and .Next() methods - no

Cursor/Iterator:

  • Cursor is an interface, cant be nil, can't return error
  • cursor.Prefix(prefix) filtering keys by given prefix. Badger using i.Prefix. RemoteDb - to support server side filtering.
  • cursor.PrefetchSize(prefix) - useful for Badger and Remote
  • Badger iterator require i.Close() call - abstraction automated it.
  • Badger iterator has AllVersions=true by default - why?

Concept of Item:

  • Badger's concept of Item adding complexity, need hide it: k,v,err := curor.First()
  • No Lazy values, but can disable fetching values by: .Cursor().PrefetchValues(false).FirstKey()
  • Badger's metadata, ttl and version - dont expose

Managed/un-managed transactions

  • Tx is an interface
  • db.Update, db.View - yes
  • db.Batch - no

Errors:

  • Lib-Errors must be properly wrapped to project: for example ethdb.ErrKeyNotFound

Badgers streaming:

  • Need more research: why its based on callback instead of “for channel”? Is it ordered? Is it stoppable?
  • Is it equal to Bolts ForEach?

Yeld: abstraction leak from RemoteDb, but need investigate how Badger Streams working here

i.SeekTo vs i.Rewind: TBD

in-memory LRU cache: TBD

Copy of data:

  • Bolt does copy keys inside .Put(), but doesn't copy values. How behave Badger here?

Not covered by Abstractions:

  • DB stats, bucket.Stats(), item.EstimatedSize()
  • buckets stats, buckets list
  • Merge operator of Badger
  • TTL of keys
  • Reverse Iterator
  • Fetch AllVersions of Badger
  • Monotonic int DB.GetSequence
  • Nested Buckets
  • Backups, tx.WriteTo

Result interface:

type DB interface {
	View(ctx context.Context, f func(tx Tx) error) (err error)
	Update(ctx context.Context, f func(tx Tx) error) (err error)
	Close() error
}

type Tx interface {
	Bucket(name []byte) Bucket
}

type Bucket interface {
	Get(key []byte) (val []byte, err error)
	Put(key []byte, value []byte) error
	Delete(key []byte) error
	Cursor() Cursor
}

type Cursor interface {
	Prefix(v []byte) Cursor
	From(v []byte) Cursor
	MatchBits(uint) Cursor
	Prefetch(v uint) Cursor
	NoValues() Cursor

	First() ([]byte, []byte, error)
	Seek(seek []byte) ([]byte, []byte, error)
	Next() ([]byte, []byte, error)
	FirstKey() ([]byte, uint64, error)
	SeekKey(seek []byte) ([]byte, uint64, error)
	NextKey() ([]byte, uint64, error)

	Walk(walker func(k, v []byte) (bool, error)) error
	WalkKeys(walker func(k []byte, vSize uint64) (bool, error)) error
}

Naming:

  • Iter shorter Cursor shorter Iterator
  • Opts shorter Options
  • Walk shorter ForEach