erigon-pulse

mirror of https://gitlab.com/pulsechaincom/erigon-pulse.git synced 2024-12-22 19:50:36 +00:00

Author	SHA1	Message	Date
milen	230b013096	metrics: separate usage of prometheus counter and gauge interfaces (#8793 )	2023-11-24 16:15:12 +01:00
Mark Holt	509a7af26a	Discovery zero refresh timer (#8661 ) This fixes an issue where the mumbai testnet node struggle to find peers. Before this fix in general test peer numbers are typically around 20 in total between eth66, eth67 and eth68. For new peers some can struggle to find even a single peer after days of operation. These are the numbers after 12 hours or running on a node which previously could not find any peers: eth66=13, eth67=76, eth68=91. The root cause of this issue is the following: - A significant number of mumbai peers around the boot node return network ids which are different from those currently available in the DHT - The available nodes are all consequently busy and return 'too many peers' for long periods These issues case a significant number of discovery timeouts, some of the queries will never receive a response. This causes the discovery read loop to enter a channel deadlock - which means that no responses are processed, nor timeouts fired. This causes the discovery process in the node to stop. From then on it just re-requests handshakes from a relatively small number of peers. This check in fixes this situation with the following changes: - Remove the deadlock by running the timer in a separate go-routine so it can run independently of the main request processing. - Allow the discovery process matcher to match on port if no id match can be established on initial ping. This allows subsequent node validation to proceed and if the node proves to be valid via the remainder of the look-up and handshake process it us used as a valid peer. - Completely unsolicited responses, i.e. those which come from a completely unknown ip:port combination continue to be ignored. -	2023-11-07 08:48:58 +00:00
Dmytro	9adf31b8eb	bytes transfet separated by capability and category (#8568 ) Co-authored-by: Mark Holt <mark@distributed.vision>	2023-10-27 22:30:28 +03:00
Dmytro	ec59be2261	Dvovk/sentinel and sentry peers data collect (#8533 )	2023-10-23 17:33:08 +03:00
a	436493350e	Sentinel refactor (#8296 ) 1. changes sentinel to use an http-like interface 2. moves hexutil, crypto/blake2b, metrics packages to erigon-lib	2023-10-22 01:17:18 +02:00
battlmonstr	d6df923dd8	p2p: limit ping requests from a single peer (#8113 ) see: https://github.com/ethereum/go-ethereum/pull/27887	2023-09-06 17:56:03 +02:00
Mark Holt	8ea0096d56	moved metrics sub packages types to metrics (#8119 ) This is a non functional change which consolidates the various packages under metrics into the top level package now that the dead code is removed. It is a precursor to the removal of Victoria metrics after which all erigon metrics code will be contained in this single package.	2023-09-03 08:09:27 +07:00
battlmonstr	340b9811b0	p2p: refactor peer errors to propagate with a DiscReason (#8089 ) Improve p2p error handling to propagate errors from the origin up the call chain the Server peer removal code using a new PeerError type containing a DiscReason and a more detailed description. The origin can be tracked down using PeerErrorCode (code) and DiscReason (reason) which looks like this in the log: > [TRACE] [08-28\|16:33:40.205] Removing p2p peer peercount=0 url=enode://d399f4b...@1.2.3.4:30303 duration=6.901ms err="PeerError(code=remote disconnect reason, reason=too many peers, err=<nil>, message=Peer.run got a remote DiscReason)"	2023-08-31 16:45:23 +01:00
Mark Holt	a4cfbe0d56	Heimdall metrics + Metrics HTTP server rationalization (#8094 ) This is an update of: https://github.com/ledgerwatch/erigon/pull/7846 which uses a local fork of victoria metrics to include the changes that https://github.com/anshalshukla added to the original for we where using. It also includes code to address the duplicate metrics issue identified here: https://github.com/ledgerwatch/erigon/issues/8053 It has one more associated fix which is to correctly add a metadata label to counters, these where previously labelled as gauges. e.g. ``` # TYPE p2p_peers counter p2p_peers 0 ``` rather than ``` # TYPE p2p_peers gauge p2p_peers 0 ``` --------- Co-authored-by: Anshal Shukla <53994948+anshalshukla@users.noreply.github.com> Co-authored-by: Anshal Shukla <shukla.anshal85@gmail.com>	2023-08-31 09:04:27 +01:00
battlmonstr	bb2c2adbb6	p2p: fix RLPx disconnect message decoding (#8056 ) The disconnect message could either be a plain integer, or a list with one integer element. We were encoding it as a plain integer, but decoding as a list. Change this to be able to decode any format.	2023-08-24 13:49:19 +02:00
hexoscott	efd541028c	read metrics config from yaml file (#7089 )	2023-03-14 00:07:05 +00:00
Alex Sharov	8afeee56c8	Downloader extract, step2 (#6076 )	2022-11-20 10:41:30 +07:00
Håvard Anda Estensen	7c15ed59e4	Enable prealloc linter (#5177 ) * Enable prealloc linter * Set inital slice len to 0	2022-08-26 10:04:36 +07:00
ledgerwatch	64684034d6	p2p: define DiscReason as uint8 (#4090 ) Co-authored-by: Alexey Sharp <alexeysharp@Alexeys-iMac.local>	2022-05-06 16:19:53 +01:00
ledgerwatch	f56d4c5881	Switch peerId from 256 to 512 bit (as in stable) (#3862 ) * Switch peerId from 256 to 512 bit (as in stable) * go mod tidy * Fix some tests * Fixed * Fixes * Fix tests * Update to erigon-lib main Co-authored-by: Alex Sharp <alexsharp@Alexs-MacBook-Pro.local> Co-authored-by: Alexey Sharp <alexeysharp@Alexeys-iMac.local>	2022-04-10 08:01:25 +01:00
battlmonstr	4337871f7f	Rename log/logger to follow conventions. (#3579 ) * use "log" for struct fields * use "logger" for function parameters and local vars This is a compromise between: 1) using logger := log.New() to avoid aliasing (log := log.New()) 2) and keeping it short when logging e.g.: srv.log.Info(...)	2022-02-22 18:17:15 +00:00
TBC Dev	e1c44cd19b	Change sentry peer_id from H512 pubkey to H256 keccak256(pubkey) (#3013 ) * Rename protoHandshake.ID to protoHandshake.Pubkey * Fix enode.ID comment descriptions * Change sentry peer_id from H512 pubkey to H256 keccak256(pubkey) * Simplify PeerInfo helpers	2021-11-22 05:39:31 +00:00
Alex Sharov	286572e99f	Remove some unused code related to metrics (#2469 )	2021-07-29 22:37:48 +07:00
Alex Sharov	5069558752	Apache licensed logger (#2460 )	2021-07-29 17:23:23 +07:00
alex.sharov	bf06a2ce4b	better peers logging	2021-07-03 10:35:11 +07:00
Alex Sharov	59d05dc5fe	hide file exists err (#2218 )	2021-06-22 11:09:45 +01:00
BitBaseBit	7ed337cdcc	Implemented panic handling, graceful shutdown and reporting for all goroutines that don't explicitly handle them. (#2153 ) * implemented crash reporting for all goroutine panics that aren't handled explicitly * implemented crash reporting for all goroutine panics that aren't handled explicitly * changed node defaults back to originals after testing * implemented panic handling for all goroutines that don't explicitly handle them, outputting the stack trace to a file in crashreports * handling panics on all goroutines gracefully * updated missing call * error assignment * implemented suggestions * path.Join added * implemented Evgeny's suggestions * changed path.Join to filepath.Join for cross-platform * added err check * updated RecoverStackTrace to LogPanic * updated closures * removed call of common.Go to some goroutines * updated scope capture * removed testing files * reverted back to original method, I feel like its less intrusive * update filename for clarity	2021-06-13 17:41:39 +01:00
Alex Sharov	0be3044b7e	rename (#1978 ) * rename * rename "make grpc" * rename "abi bindings templates" * rename "abi bindings templates"	2021-05-20 19:25:53 +01:00
Artem Vorotnikov	9b8cdc0f22	Fix lints and remove more unused code (#1621 )	2021-03-29 10:58:45 +07:00
Martin Holst Swende	aaeb4a40a3	eth: don't wait for snap registration if we're not running snap (#22272 ) Prevents a situation where we (not running snap) connects with a peer running snap, and get stalled waiting for snap registration to succeed (which will never happen), which cause a waitgroup wait to halt shutdown	2021-03-10 10:26:58 +01:00
Péter Szilágyi	08ad6aaec7	eth: check snap satelliteness, delegate drop to eth (#22235 ) * eth: check snap satelliteness, delegate drop to eth * eth: better handle eth/snap satellite relation, merge reg/unreg paths # Conflicts: # eth/handler.go # eth/peer.go	2021-03-09 13:55:09 +01:00
Martin Holst Swende	ff2b58887f	eth, p2p: reserve half peer slots for snap peers during snap sync (#22171 ) * eth, p2p: reserve half peer slots for snap peers during snap sync * eth: less logging * eth: rework the eth/snap peer reservation logic * eth: rework the eth/snap peer reservation logic (again)	2021-03-09 12:46:30 +01:00
Martin Holst Swende	926cc78870	eth, p2p: use truncated names (#21698 ) * peer: return localAddr instead of name to prevent spam We currently use the name (which can be freely set by the peer) in several log messages. This enables malicious actors to write spam into your geth log. This commit returns the localAddr instead of the freely settable name. * p2p: reduce usage of peer.Name in warn messages * eth, p2p: use truncated names * Update peer.go Co-authored-by: Marius van der Wijden <m.vanderwijden@live.de> Co-authored-by: Felix Lange <fjl@twurst.com>	2020-10-26 17:16:00 +01:00
Péter Szilágyi	d73075bd3d	p2p: measure packet throughput too, not just bandwidth	2020-08-07 11:21:39 +02:00
ucwong	6f75f79ebe	p2p: defer wait group done in protocol start (#20951 )	2020-05-20 15:26:22 +03:00
Felix Lange	e494fbd436	p2p: remove MeteredPeerEvent (#20679 ) This event was added for the dashboard, but we don't need it anymore since the dashboard is gone.	2020-02-27 17:21:20 +03:00
Alexey Akhunov	fe01bccbb8	Apply Turbo-Geth modifications to go-ethereum codebase	2019-11-01 21:52:03 +01:00
Péter Szilágyi	a2a60869c8	p2p: measure subprotocol bandwidth usage	2019-09-27 18:00:25 +03:00
Kurkó Mihály	a1f8549262	p2p: add ENR to PeerInfo (#19816 )	2019-07-19 11:25:43 +02:00
Martin Holst Swende	7fd82a0e3e	p2p: add address info to peer event reporting (#19716 )	2019-07-05 20:27:13 +02:00
Felix Lange	c420dcb39c	p2p: enforce connection retry limit on server side (#19684 ) The dialer limits itself to one attempt every 30s. Apply the same limit in Server and reject peers which try to connect too eagerly. The check against the limit happens right after accepting the connection. Further changes in this commit ensure we pass the Server logger down to Peer instances, discovery and dialState. Unit test logging now works in all Server tests.	2019-06-11 12:45:33 +02:00
Felix Lange	1895059119	p2p: add enode URL to PeerInfo (#17838 )	2018-10-04 18:13:21 +03:00
Felix Lange	30cd5c1854	all: new p2p node representation (#17643 ) Package p2p/enode provides a generalized representation of p2p nodes which can contain arbitrary information in key/value pairs. It is also the new home for the node database. The "v4" identity scheme is also moved here from p2p/enr to remove the dependency on Ethereum crypto from that package. Record signature handling is changed significantly. The identity scheme registry is removed and acceptable schemes must be passed to any method that needs identity. This means records must now be validated explicitly after decoding. The enode API is designed to make signature handling easy and safe: most APIs around the codebase work with enode.Node, which is a wrapper around a valid record. Going from enr.Record to enode.Node requires a valid signature. * p2p/discover: port to p2p/enode This ports the discovery code to the new node representation in p2p/enode. The wire protocol is unchanged, this can be considered a refactoring change. The Kademlia table can now deal with nodes using an arbitrary identity scheme. This requires a few incompatible API changes: - Table.Lookup is not available anymore. It used to take a public key as argument because v4 protocol requires one. Its replacement is LookupRandom. - Table.Resolve takes enode.Node instead of NodeID. This is also for v4 protocol compatibility because nodes cannot be looked up by ID alone. - Types Node and NodeID are gone. Further commits in the series will be fixes all over the the codebase to deal with those removals. p2p: port to p2p/enode and discovery changes This adapts package p2p to the changes in p2p/discover. All uses of discover.Node and discover.NodeID are replaced by their equivalents from p2p/enode. New API is added to retrieve the enode.Node instance of a peer. The behavior of Server.Self with discovery disabled is improved. It now tries much harder to report a working IP address, falling back to 127.0.0.1 if no suitable address can be determined through other means. These changes were needed for tests of other packages later in the series. * p2p/simulations, p2p/testing: port to p2p/enode No surprises here, mostly replacements of discover.Node, discover.NodeID with their new equivalents. The 'interesting' API changes are: - testing.ProtocolSession tracks complete nodes, not just their IDs. - adapters.NodeConfig has a new method to create a complete node. These changes were needed to make swarm tests work. Note that the NodeID change makes the code incompatible with old simulation snapshots. * whisper/whisperv5, whisper/whisperv6: port to p2p/enode This port was easy because whisper uses []byte for node IDs and URL strings in the API. * eth: port to p2p/enode Again, easy to port because eth uses strings for node IDs and doesn't care about node information in any way. * les: port to p2p/enode Apart from replacing discover.NodeID with enode.ID, most changes are in the server pool code. It now deals with complete nodes instead of (Pubkey, IP, Port) triples. The database format is unchanged for now, but we should probably change it to use the node database later. * node: port to p2p/enode This change simply replaces discover.Node and discover.NodeID with their new equivalents. * swarm/network: port to p2p/enode Swarm has its own node address representation, BzzAddr, containing both an overlay address (the hash of a secp256k1 public key) and an underlay address (enode:// URL). There are no changes to the BzzAddr format in this commit, but certain operations such as creating a BzzAddr from a node ID are now impossible because node IDs aren't public keys anymore. Most swarm-related changes in the series remove uses of NewAddrFromNodeID, replacing it with NewAddr which takes a complete node as argument. ToOverlayAddr is removed because we can just use the node ID directly.	2018-09-25 00:59:00 +02:00
Felföldi Zsolt	c4df67461f	Merge pull request #16333 from shazow/addremovetrustedpeer rpc: Add admin_addTrustedPeer and admin_removeTrustedPeer.	2018-08-06 13:30:04 +02:00
jkcomment	65c91ad5e7	p2p: correct comments typo (#17184 )	2018-07-18 10:41:18 +03:00
ethersphere	e187711c65	swarm: network rewrite merge	2018-06-21 21:10:31 +02:00
Andrey Petrov	6209545083	p2p: Wrap conn.flags ops with atomic.Load/Store	2018-06-21 12:22:47 -04:00
Andrey Petrov	dcca66bce8	p2p: Cache inbound flag on Peer.isInbound to avoid a race	2018-06-21 12:22:47 -04:00
Guilherme Salgado	c60f6f6214	p2p: don't discard reason set by Disconnect (#16559 ) Peer.run was discarding the reason for disconnection sent to the disc channel by Disconnect.	2018-05-09 01:20:20 +02:00
thomasmodeneis	ba1030b6b8	build: enable goimports and varcheck linters (#16446 )	2018-04-18 00:53:50 +02:00
Felix Lange	9123eceb0f	p2p, p2p/discover: misc connectivity improvements (#16069 ) * p2p: add DialRatio for configuration of inbound vs. dialed connections * p2p: add connection flags to PeerInfo * p2p/netutil: add SameNet, DistinctNetSet * p2p/discover: improve revalidation and seeding This changes node revalidation to be periodic instead of on-demand. This should prevent issues where dead nodes get stuck in closer buckets because no other node will ever come along to replace them. Every 5 seconds (on average), the last node in a random bucket is checked and moved to the front of the bucket if it is still responding. If revalidation fails, the last node is replaced by an entry of the 'replacement list' containing recently-seen nodes. Most close buckets are removed because it's very unlikely we'll ever encounter a node that would fall into any of those buckets. Table seeding is also improved: we now require a few minutes of table membership before considering a node as a potential seed node. This should make it less likely to store short-lived nodes as potential seeds. * p2p/discover: fix nits in UDP transport We would skip sending neighbors replies if there were fewer than maxNeighbors results and CheckRelayIP returned an error for the last one. While here, also resolve a TODO about pong reply tokens.	2018-02-12 14:36:09 +02:00
Lewis Marshall	54aeb8e4c0	p2p/simulations: various stability fixes (#15198 ) p2p/simulations: introduce dialBan - Refactor simulations/network connection getters to support avoiding simultaneous dials between two peers If two peers dial simultaneously, the connection will be dropped to help avoid that, we essentially lock the connection object with a timestamp which serves as a ban on dialing for a period of time (dialBanTimeout). - The connection getter InitConn can be wrapped and passed to the nodes via adapters.NodeConfig#Reachable field and then used by the respective services when they initiate connections. This massively stablise the emerging connectivity when running with hundreds of nodes bootstrapping a network. p2p: add Inbound public method to p2p.Peer p2p/simulations: Add server id to logs to support debugging in-memory network simulations when multiple peers are logging. p2p: SetupConn now returns error. The dialer checks the error and only calls resolve if the actual TCP dial fails.	2017-12-01 12:49:04 +01:00
Péter Szilágyi	2ee885958b	p2p: snappy encoding for devp2p (version bump to 5) (#15106 ) * p2p: snappy encoding for devp2p (version bump to 5) * p2p: remove lazy decompression, enforce 16MB limit	2017-09-26 16:54:49 +03:00
Lewis Marshall	9feec51e2d	p2p: add network simulation framework (#14982 ) This commit introduces a network simulation framework which can be used to run simulated networks of devp2p nodes. The intention is to use this for testing protocols, performing benchmarks and visualising emergent network behaviour.	2017-09-25 10:08:07 +02:00
Martin Holst Swende	dc92779c0a	p2p: change ping ticker to timer (#15071 ) Using a Timer over Ticker seems to be a lot better, though I cannot fully account for why that it behaves so (since Ticker should be more bursty, but not necessarily more active over time, but that may depend on how long window it uses to decide on when to tick next)	2017-09-04 09:24:52 +02:00

1 2

99 Commits