netbird

mirror of https://github.com/netbirdio/netbird.git synced 2026-05-21 08:09:55 +00:00

Author	SHA1	Message	Date
Viktor Liu	7e5846a1ee	Resolve merge conflicts with main	2026-05-20 15:38:01 +02:00
Maycon Santos	8e2505b59c	[management] Add metrics for peer status updates and ephemeral cleanup (#6196 ) * [management] Add metrics for peer status updates and ephemeral cleanup The session-fenced MarkPeerConnected / MarkPeerDisconnected path and the ephemeral peer cleanup loop both run silently today: when fencing rejects a stale stream, when a cleanup tick deletes peers, or when a batch delete fails, we have no operational signal beyond log lines. Add OpenTelemetry counters and a histogram so the same SLO-style dashboards that already exist for the network-map controller can cover peer connect/disconnect and ephemeral cleanup too. All new attributes are bounded enums: operation in {connect,disconnect} and outcome in {applied,stale,error,peer_not_found}. No account, peer, or user ID is ever written as a metric label — total cardinality is fixed at compile time (8 counter series, 2 histogram series, 4 unlabeled ephemeral series). Metric methods are nil-receiver safe so test composition that doesn't wire telemetry (the bulk of the existing tests) works unchanged. The ephemeral manager exposes a SetMetrics setter rather than taking the collector through its constructor, keeping the constructor signature stable across all test call sites. * [management] Add OpenTelemetry metrics for ephemeral peer cleanup Introduce counters for tracking ephemeral peer cleanup, including peers pending deletion, cleanup runs, successful deletions, and failed batches. Metrics are nil-receiver safe to ensure compatibility with test setups without telemetry.	2026-05-20 12:11:15 +02:00
Maycon Santos	d3f3e08035	Avoid context cancellation in `cancelPeerRoutines` (#6175 ) When closing go routines and handling peer disconnect, we should avoid canceling the flow due to parent gRPC context cancellation. This change triggers disconnection handling with a context that is not bound to the parent gRPC cancellation.	2026-05-20 12:11:15 +02:00
Maycon Santos	d250f92c43	feat(reverse-proxy): clusters API surfaces type, online status, and capability flags (#6148 ) The cluster listing now answers three questions in one round-trip instead of forcing the dashboard to cross-reference the domains API: which clusters can this account see, are they currently up, and what do they support. The ProxyCluster wire type drops the boolean self_hosted in favour of a `type` enum (`account` / `shared`) plus explicit `online`, `supports_custom_ports`, `require_subdomain`, and `supports_crowdsec` fields. Store query reworked so offline clusters still appear (no last_seen WHERE), with online and connected_proxies both derived from the existing 2-min active window via portable CASE expressions; the 1-hour heartbeat reaper still removes long-stale rows. Service manager enriches each cluster with the capability flags via the existing per-cluster lookups (CapabilityProvider now also exposes ClusterSupportsCrowdSec). GetActiveClusterAddresses* keep their tight 2-min filter so service routing and domain enumeration aren't pulled into the wider window. The hard cut removes self_hosted from the response — the dashboard is the only consumer and is updated in the matching PR; no transitional field is shipped. Adds a cross-engine regression test asserting offline clusters surface, connected_proxies counts only fresh proxies, and account-scoped BYOP clusters never leak across accounts.	2026-05-20 10:08:34 +02:00
Maycon Santos	af24fd7796	[management] Add metrics for peer status updates and ephemeral cleanup (#6196 ) * [management] Add metrics for peer status updates and ephemeral cleanup The session-fenced MarkPeerConnected / MarkPeerDisconnected path and the ephemeral peer cleanup loop both run silently today: when fencing rejects a stale stream, when a cleanup tick deletes peers, or when a batch delete fails, we have no operational signal beyond log lines. Add OpenTelemetry counters and a histogram so the same SLO-style dashboards that already exist for the network-map controller can cover peer connect/disconnect and ephemeral cleanup too. All new attributes are bounded enums: operation in {connect,disconnect} and outcome in {applied,stale,error,peer_not_found}. No account, peer, or user ID is ever written as a metric label — total cardinality is fixed at compile time (8 counter series, 2 histogram series, 4 unlabeled ephemeral series). Metric methods are nil-receiver safe so test composition that doesn't wire telemetry (the bulk of the existing tests) works unchanged. The ephemeral manager exposes a SetMetrics setter rather than taking the collector through its constructor, keeping the constructor signature stable across all test call sites. * [management] Add OpenTelemetry metrics for ephemeral peer cleanup Introduce counters for tracking ephemeral peer cleanup, including peers pending deletion, cleanup runs, successful deletions, and failed batches. Metrics are nil-receiver safe to ensure compatibility with test setups without telemetry.	2026-05-18 22:55:19 +02:00
Viktor Liu	9f0aa1ce26	Add embedded VNC server with JWT auth and per-peer toggle	2026-05-16 16:49:14 +02:00
Maycon Santos	347c5bf317	Avoid context cancellation in `cancelPeerRoutines` (#6175 ) When closing go routines and handling peer disconnect, we should avoid canceling the flow due to parent gRPC context cancellation. This change triggers disconnection handling with a context that is not bound to the parent gRPC cancellation.	2026-05-16 16:29:01 +02:00
Vlad	e916f12cca	[proxy] auth token generation on mapping (#6157 ) * [management / proxy] auth token generation on mapping * fix tests	2026-05-15 19:13:44 +02:00
Viktor Liu	07e5450117	[management] Bracket IPv6 reverse-proxy target hosts when building URL Host field (#6141 )	2026-05-14 16:42:40 +02:00
Vlad	77b479286e	[management] fix offline statuses for public proxy clusters (#6133 )	2026-05-14 13:27:50 +02:00
Vlad	07cbfdbede	[proxy] feature: bring your own proxy (#5627 )	2026-05-11 14:31:38 +02:00
Nicolas Frati	e89aad09f5	[management] Enable MFA for local users (#5804 ) * wip: totp for local users * fix providers not getting populated * polished UI and fix post_login_redirect_uri * fix: make sure logout is only prompted from oidc flow Signed-off-by: jnfrati <nicofrati@gmail.com> * update templates Signed-off-by: jnfrati <nicofrati@gmail.com> * deps: update dex dependency Signed-off-by: jnfrati <nicofrati@gmail.com> * fix qube issues Signed-off-by: jnfrati <nicofrati@gmail.com> * replace window with globalThis on home html Signed-off-by: jnfrati <nicofrati@gmail.com> * fixed coderabbit comments Signed-off-by: jnfrati <nicofrati@gmail.com> * debug * remove unused config and rename totp issuer * deps: update dex reference to latest * add dashboard post logout redirect uri to embedded config * implemented api for mfa configuration * update docs and config parsing * catch error on idp manager init mfa * fix tests * Add remember me for MFA * Add cookie encryption and session share between tabs * fixed logout showing non actionable error and session cookie encription key * fixed missing mfa settings on sql query for account * fix code index for mfa activity --------- Signed-off-by: jnfrati <nicofrati@gmail.com> Co-authored-by: braginini <bangvalo@gmail.com>	2026-05-08 16:31:20 +02:00
Pascal Fischer	39eac377e4	[management] add update reason to buffered calls (#6103 )	2026-05-07 15:55:59 +02:00
Viktor Liu	205ebcfda2	[management, client] Add IPv6 overlay support (#5631 )	2026-05-07 11:33:37 +02:00
Pascal Fischer	cfb1b3fe31	[proxy] consolidate mapping update (#6072 )	2026-05-05 18:40:42 +02:00
Pascal Fischer	97db824929	[management] fix proxy reconnect (#6063 )	2026-05-04 20:43:25 +02:00
Pascal Fischer	f29f5a0978	[management] add monitoring for nmap update source (#6036 )	2026-04-30 14:52:54 +02:00
Bethuel Mmbaga	df197d5001	[management] Prevent JWT reuse during peer login (#6002 )	2026-04-29 15:04:27 +03:00
Vlad	154b81645a	[management] removed legacy network map code (#5565 )	2026-04-27 16:02:54 +02:00
Vlad	b6038e8acd	[management] refactor: changeable pat rate limiting (#5946 )	2026-04-23 15:13:22 +02:00
Maycon Santos	53b04e512a	[management] Reuse a single cache store across all management server consumers (#5889 ) * Add support for legacy IDP cache environment variable * Centralize cache store creation to reuse a single Redis connection pool Each cache consumer (IDP cache, token store, PKCE store, secrets manager, EDR validator) was independently calling NewStore, creating separate Redis clients with their own connection pools — up to 1400 potential connections from a single management server process. Introduce a shared CacheStore() singleton on BaseServer that creates one store at boot and injects it into all consumers. Consumer constructors now receive a store.StoreInterface instead of creating their own. For Redis mode, all consumers share one connection pool (1000 max conns). For in-memory mode, all consumers share one GoCache instance. * Update management-integrations module to latest version * sync go.sum * Export `GetAddrFromEnv` to allow reuse across packages * Update management-integrations module version in go.mod and go.sum * Update management-integrations module version in go.mod and go.sum	2026-04-16 16:04:53 +02:00
Viktor Liu	0a30b9b275	[management, proxy] Add CrowdSec IP reputation integration for reverse proxy (#5722 )	2026-04-14 12:14:58 +02:00
Pascal Fischer	cf86b9a528	[management] enable access log cleanup by default (#5842 )	2026-04-10 17:07:27 +02:00
Pascal Fischer	15709bc666	[management] update account delete with proper proxy domain and service cleanup (#5817 )	2026-04-10 13:08:04 +02:00
Pascal Fischer	14b3b77bda	[management] validate permissions on groups read with name (#5749 )	2026-04-07 14:13:09 +02:00
Bethuel Mmbaga	9d1a37c644	[management,client] Revert gRPC client secret removal (#5781 ) * This reverts commit `e5914e4e8b` Signed-off-by: bcmmbaga <bethuelmbaga12@gmail.com> * Deprecate client secret in proto Signed-off-by: bcmmbaga <bethuelmbaga12@gmail.com> * Fix lint Signed-off-by: bcmmbaga <bethuelmbaga12@gmail.com> --------- Signed-off-by: bcmmbaga <bethuelmbaga12@gmail.com>	2026-04-02 18:21:00 +02:00
Viktor Liu	5bf2372c4d	[management] Fix L4 service creation deadlock on single-connection databases (#5779 )	2026-04-02 14:46:14 +02:00
shuuri-labs	940f530ac2	[management] Legacy to embedded IdP migration tool (#5586 )	2026-04-01 13:53:19 +02:00
Bethuel Mmbaga	e5914e4e8b	[management,client] Remove client secret from gRPC auth flow (#5751 ) Remove client secret from gRPC auth flow. The secret was originally included to support providers like Google Workspace that don't offer a proper PKCE flow, but this is no longer necessary with the embedded IdP. Deployments using such providers should migrate to the embedded IdP instead.	2026-03-31 18:50:49 +03:00
Viktor Liu	0765352c99	[management] Persist proxy capabilities to database (#5720 )	2026-03-30 13:03:42 +02:00
Pascal Fischer	be6fd119d8	[management] no events for temporary peers (#5719 )	2026-03-30 10:08:02 +02:00
Pascal Fischer	7e1cce4b9f	[management] add terminated field to service (#5700 )	2026-03-26 16:59:08 +01:00
Viktor Liu	0fc63ea0ba	[management] Allow multiple header auths with same header name (#5678 )	2026-03-24 16:18:21 +01:00
Viktor Liu	5b85edb753	[management] Omit proxy_protocol from API response when false (#5656 ) The internal Target model uses a plain bool for ProxyProtocol, which was always serialized to the API response as false even when not configured. Only set the API field when true so it gets omitted via omitempty when unset.	2026-03-23 17:53:17 +01:00
Viktor Liu	b550a2face	[management, proxy] Add require_subdomain capability for proxy clusters (#5628 )	2026-03-20 11:29:50 +01:00
Pascal Fischer	a1858a9cb7	[management] recover proxies after cleanup if heartbeat is still running (#5617 )	2026-03-18 11:48:38 +01:00
Viktor Liu	212b34f639	[management] Add GET /reverse-proxies/clusters endpoint (#5611 )	2026-03-18 11:15:56 +08:00
Viktor Liu	f0eed50678	[management] Accept domain target type for L4 reverse proxy services (#5612 )	2026-03-17 16:29:03 +01:00
Viktor Liu	387e374e4b	[proxy, management] Add header auth, access restrictions, and session idle timeout (#5587 )	2026-03-16 15:22:00 +01:00
Viktor Liu	3e6baea405	[management,proxy,client] Add L4 capabilities (TLS/TCP/UDP) (#5530 )	2026-03-13 18:36:44 +01:00
Zoltan Papp	fe9b844511	[client] refactor auto update workflow (#5448 ) Auto-update logic moved out of the UI into a dedicated updatemanager.Manager service that runs in the connection layer. The UI no longer polls or checks for updates independently. The update manager supports three modes driven by the management server's auto-update policy: No policy set by mgm: checks GitHub for the latest version and notifies the user (previous behavior, now centralized) mgm enforces update: the "About" menu triggers installation directly instead of just downloading the file — user still initiates the action mgm forces update: installation proceeds automatically without user interaction updateManager lifecycle is now owned by daemon, giving the daemon server direct control via a new TriggerUpdate RPC Introduces EngineServices struct to group external service dependencies passed to NewEngine, reducing its argument count from 11 to 4	2026-03-13 17:01:28 +01:00
Pascal Fischer	d86875aeac	[management] Exclude proxy from peer approval (#5588 )	2026-03-13 15:01:59 +01:00
Pascal Fischer	e50e124e70	[proxy] Fix domain switching update (#5585 )	2026-03-12 17:12:26 +01:00
Vlad	b5489d4986	[management] set components network map by default and optimize memory usage (#5575 ) * Network map now defaults to compacted mode at startup; environment parsing issues yield clearer warnings and disabling compacted mode is logged. * Bug Fixes * DNS enablement and nameserver selection now correctly respect group membership, reducing incorrect DNS assignments. * Refactor * Internal routing and firewall rule generation streamlined for more consistent rule IDs and safer peer handling. * Performance * Minor memory and slice allocation improvements for peer/group processing.	2026-03-11 18:19:17 +01:00
Pascal Fischer	5585adce18	[management] add activity events for domains (#5548 ) * add activity events for domains * fix test * update activity codes * update activity codes	2026-03-09 19:04:04 +01:00
Pascal Fischer	f884299823	[proxy] refactor metrics and add usage logs (#5533 ) * New Features * Access logs now include bytes_upload and bytes_download (API and schemas updated, fields required). * Certificate issuance duration is now recorded as a metric. * Refactor * Metrics switched from Prometheus client to OpenTelemetry-backed meters; health endpoint now exposes OpenMetrics via OTLP exporter. * Tests * Metric tests updated to use OpenTelemetry Prometheus exporter and MeterProvider.	2026-03-09 18:45:45 +01:00
Pascal Fischer	11eb725ac8	[management] only count login request duration for successful logins (#5545 )	2026-03-09 14:56:46 +01:00
Pascal Fischer	30c02ab78c	[management] use the cache for the pkce state (#5516 )	2026-03-09 12:23:06 +01:00
Pascal Fischer	5c20f13c48	[management] fix domain uniqueness (#5529 )	2026-03-07 10:46:37 +01:00
Pascal Fischer	e6587b071d	[management] use realip for proxy registration (#5525 )	2026-03-06 16:11:44 +01:00

1 2 3

114 Commits