netbird

mirror of https://github.com/netbirdio/netbird.git synced 2026-04-16 07:16:38 +00:00

Author	SHA1	Message	Date
bcmmbaga	ed041f6f79	Fix integration tests	2026-04-15 19:34:57 +03:00
bcmmbaga	2d30c6b3f1	Move router validation to handlers	2026-04-15 18:28:59 +03:00
bcmmbaga	08801265cb	Refactor	2026-04-15 18:21:48 +03:00
bcmmbaga	f6b7695a93	Validate NetworkRouter on create and update	2026-04-15 18:21:04 +03:00
Viktor Liu	0d86de47df	[client] Add PCP support (#5219 )	2026-04-15 11:43:16 +02:00
Viktor Liu	e804a705b7	[infrastructure] Update sign pipeline version to v0.1.2 (#5884 )	2026-04-14 17:08:35 +02:00
Pascal Fischer	46fc8c9f65	[proxy] direct redirect to SSO (#5874 )	2026-04-14 13:47:02 +02:00
Viktor Liu	d7ad908962	[misc] Add CI check for proto version string changes (#5854 ) * Add CI check for proto version string changes * Handle pagination and missing patch data in proto version check	2026-04-14 13:36:26 +02:00
Pascal Fischer	c5623307cc	[management] add context cancel monitoring (#5879 )	2026-04-14 12:49:18 +02:00
Vlad	7f666b8022	[management] revert ctx dependency in get account with backpressure (#5878 )	2026-04-14 12:16:03 +02:00
Viktor Liu	0a30b9b275	[management, proxy] Add CrowdSec IP reputation integration for reverse proxy (#5722 )	2026-04-14 12:14:58 +02:00
Viktor Liu	4eed459f27	[client] Fix DNS resolution with userspace WireGuard and kernel firewall (#5873 ) v0.68.2	2026-04-13 16:23:57 +02:00
Zoltan Papp	13539543af	[client] Fix/grpc retry (#5750 ) * [client] Fix flow client Receive retry loop not stopping after Close Use backoff.Permanent for canceled gRPC errors so Receive returns immediately instead of retrying until context deadline when the connection is already closed. Add TestNewClient_PermanentClose to verify the behavior. The connectivity.Shutdown check was meaningless because when the connection is shut down, c.realClient.Events(ctx, grpc.WaitForReady(true)) on the nex line already fails with codes.Canceled — which is now handled as a permanent error. The explicit state check was just duplicating what gRPC already reports through its normal error path. * [client] remove WaitForReady from stream open call grpc.WaitForReady(true) parks the RPC call internally until the connection reaches READY, only unblocking on ctx cancellation. This means the external backoff.Retry loop in Receive() never gets control back during a connection outage — it cannot tick, log, or apply its retry intervals while WaitForReady is blocking. Removing it restores fail-fast behaviour: Events() returns immediately with codes.Unavailable when the connection is not ready, which is exactly what the backoff loop expects. The backoff becomes the single authority over retry timing and cadence, as originally intended. * [client] Add connection recreation and improve flow client error handling Store gRPC dial options on the client to enable connection recreation on Internal errors (RST_STREAM/PROTOCOL_ERROR). Treat Unauthenticated, PermissionDenied, and Unimplemented as permanent failures. Unify mutex usage and add reconnection logging for better observability. * [client] Remove Unauthenticated, PermissionDenied, and Unimplemented from permanent error handling * [client] Fix error handling in Receive to properly re-establish stream and improve reconnection messaging * Fix test * [client] Add graceful shutdown handling and test for concurrent Close during Receive Prevent reconnection attempts after client closure by tracking a `closed` flag. Use `backoff.Permanent` for errors caused by operations on a closed client. Add a test to ensure `Close` does not block when `Receive` is actively running. * [client] Fix connection swap to properly close old gRPC connection Close the old `gRPC.ClientConn` after successfully swapping to a new connection during reconnection. * [client] Reset backoff * [client] Ensure stream closure on error during initialization * [client] Add test for handling server-side stream closure and reconnection Introduce `TestReceive_ServerClosesStream` to verify the client's ability to recover and process acknowledgments after the server closes the stream. Enhance test server with a controlled stream closure mechanism. * [client] Add protocol error simulation and enhance reconnection test Introduce `connTrackListener` to simulate HTTP/2 RST_STREAM with PROTOCOL_ERROR for testing. Refactor and rename `TestReceive_ServerClosesStream` to `TestReceive_ProtocolErrorStreamReconnect` to verify client recovery on protocol errors. * [client] Update Close error message in test for clarity * [client] Fine-tune the tests * [client] Adjust connection tracking in reconnection test * [client] Wait for Events handler to exit in RST_STREAM reconnection test Ensure the old `Events` handler exits fully before proceeding in the reconnection test to avoid dropped acknowledgments on a broken stream. Add a `handlerDone` channel to synchronize handler exits. * [client] Prevent panic on nil connection during Close * [client] Refactor connection handling to use explicit target tracking Introduce `target` field to store the gRPC connection target directly, simplifying reconnections and ensuring consistent connection reuse logic. * [client] Rename `isCancellation` to `isContextDone` and extend handling for `DeadlineExceeded` Refactor error handling to include `DeadlineExceeded` scenarios alongside `Canceled`. Update related condition checks for consistency. * [client] Add connection generation tracking to prevent stale reconnections Introduce `connGen` to track connection generations and ensure that stale `recreateConnection` calls do not override newer connections. Update stream establishment and reconnection logic to incorporate generation validation. * [client] Add backoff reset condition to prevent short-lived retry cycles Refine backoff reset logic to ensure it only occurs for sufficiently long-lived stream connections, avoiding interference with `MaxElapsedTime`. * [client] Introduce `minHealthyDuration` to refine backoff reset logic Add `minHealthyDuration` constant to ensure stream retries only reset the backoff timer if the stream survives beyond a minimum duration. Prevents unhealthy, short-lived streams from interfering with `MaxElapsedTime`. * [client] IPv6 friendly connection parsedURL.Hostname() strips IPv6 brackets. For http://[::1]:443, this turns it into ::1:443, which is not a valid host:port target for gRPC. Additionally, fmt.Sprintf("%s:%s", hostname, port) produces a trailing colon when the URL has no explicit port—http://example.com becomes example.com:. Both cases break the initial dial and reconnect paths. Use parsedURL.Host directly instead. * [client] Add `handlerStarted` channel to synchronize stream establishment in tests Introduce `handlerStarted` channel in the test server to signal when the server-side handler begins, ensuring robust synchronization between client and server during stream establishment. Update relevant test cases to wait for this signal before proceeding. * [client] Replace `receivedAcks` map with atomic counter and improve stream establishment sync in tests Refactor acknowledgment tracking in tests to use an `atomic.Int32` counter instead of a map. Replace fixed sleep with robust synchronization by waiting on `handlerStarted` signal for stream establishment. * [client] Extract `handleReceiveError` to simplify receive logic Refactor error handling in `receive` to a dedicated `handleReceiveError` method. Streamlines the main logic and isolates error recovery, including backoff reset and connection recreation. * [client] recreate gRPC ClientConn on every retry to prevent dual backoff The flow client had two competing retry loops: our custom exponential backoff and gRPC's internal subchannel reconnection. When establishStream failed, the same ClientConn was reused, allowing gRPC's internal backoff state to accumulate and control dial timing independently. Changes: - Consolidate error handling into handleRetryableError, which now handles context cancellation, permanent errors, backoff reset, and connection recreation in a single path - Call recreateConnection on every retryable error so each retry gets a fresh ClientConn with no internal backoff state - Remove connGen tracking since Receive is sequential and protected by a new receiving guard against concurrent calls - Reduce RandomizationFactor from 1 to 0.5 to avoid near-zero backoff intervals	2026-04-13 10:42:24 +02:00
Zoltan Papp	7483fec048	Fix Android internet blackhole caused by stale route re-injection on TUN rebuild (#5865 ) extraInitialRoutes() was meant to preserve only the fake IP route (240.0.0.0/8) across TUN rebuilds, but it re-injected any initial route missing from the current set. When the management server advertised exit node routes (0.0.0.0/0) that were later filtered by the route selector, extraInitialRoutes() re-added them, causing the Android VPN to capture all traffic with no peer to handle it. Store the fake IP route explicitly and append only that in notify(), removing the overly broad initial route diffing.	2026-04-13 09:38:38 +02:00
Pascal Fischer	5259e5df51	[management] add domain and service cleanup migration (#5850 )	2026-04-11 12:00:40 +02:00
Zoltan Papp	ebd78e0122	[client] Update `RaceDial` to accept context for improved cancellation handling (#5849 )	2026-04-10 20:51:04 +02:00
Pascal Fischer	cf86b9a528	[management] enable access log cleanup by default (#5842 )	2026-04-10 17:07:27 +02:00
Pascal Fischer	ee588e1536	Revert "[management] allow local routing peer resource (#5814 )" (#5847 )	2026-04-10 14:53:47 +02:00
Pascal Fischer	2a8aacc5c9	[management] allow local routing peer resource (#5814 )	2026-04-10 13:08:21 +02:00
Pascal Fischer	15709bc666	[management] update account delete with proper proxy domain and service cleanup (#5817 )	2026-04-10 13:08:04 +02:00
Pascal Fischer	789b4113fe	[misc] update dashboards (#5840 )	2026-04-10 12:15:58 +02:00
Viktor Liu	d2cdc0efec	[client] Use native firewall for peer ACLs in userspace WireGuard mode (#5668 )	2026-04-10 09:12:13 +08:00
Pascal Fischer	ee343d5d77	[management] use sql null vars (#5844 )	2026-04-09 18:12:38 +02:00
Maycon Santos	099c493b18	[management] network map tests (#5795 ) * Add network map benchmark and correctness test files * Add tests for network map components correctness and edge cases * Skip benchmarks in CI and enhance network map test coverage with new helper functions * Remove legacy network map benchmarks and tests; refactor components-based test coverage for clarity and scalability.	2026-04-08 21:28:29 +02:00
Pascal Fischer	c1d1229ae0	[management] use NullBool for terminated flag (#5829 ) v0.68.1	2026-04-08 21:08:43 +02:00
Viktor Liu	94a36cb53e	[client] Handle UPnP routers that only support permanent leases (#5826 )	2026-04-08 17:59:59 +02:00
Viktor Liu	c7ba931466	[client] Populate network addresses in FreeBSD system info (#5827 )	2026-04-08 17:14:16 +02:00
Viktor Liu	413d95b740	[client] Include service.json in debug bundle (#5825 ) * Include service.json in debug bundle * Add tests for service params sanitization logic	2026-04-08 21:10:31 +08:00
Viktor Liu	332c624c55	[client] Don't abort UI debug bundle when up/down fails (#5780 ) v0.68.0	2026-04-08 10:33:46 +02:00
Viktor Liu	dc160aff36	[client] Fix SSH proxy stripping shell quoting from forwarded commands (#5669 )	2026-04-08 10:25:57 +02:00
Zoltan Papp	96806bf55f	[relay] Replace net.Conn with context-aware Conn interface (#5770 ) * [relay] Replace net.Conn with context-aware Conn interface for relay transports Introduce a listener.Conn interface with context-based Read/Write methods, replacing net.Conn throughout the relay server. This enables proper timeout propagation (e.g. handshake timeout) without goroutine-based workarounds and removes unused LocalAddr/SetDeadline methods from WS and QUIC conns. * [relay] Refactor Peer context management to ensure proper cleanup Integrate context creation (`context.WithCancel`) directly in `NewPeer` and remove redundant initialization in `Work`. Add `ctxCancel` calls to ensure context is properly canceled during `Close` operations.	2026-04-08 09:38:31 +02:00
Viktor Liu	d33cd4c95b	[client] Add NAT-PMP/UPnP support (#5202 )	2026-04-08 15:29:32 +08:00
Maycon Santos	e2c2f64be7	[client] Fix iOS DNS upstream routing for deselected exit nodes (#5803 ) - Add GetSelectedClientRoutes() to the route manager that filters through FilterSelectedExitNodes, returning only active routes instead of all management routes - Use GetSelectedClientRoutes() in the DNS route checker so deselected exit nodes' 0.0.0.0/0 no longer matches upstream DNS IPs — this prevented the resolver from switching away from the utun-bound socket after exit node deselection - Initialize iOS DNS server with host DNS fallback addresses (1.1.1.1:53, 1.0.0.1:53) and a permanent root zone handler, matching Android's behavior — without this, unmatched DNS queries arriving via the 0.0.0.0/0 tunnel route had no handler and were silently dropped	2026-04-08 08:43:48 +02:00
Viktor Liu	cb73b94ffb	[client] Add TCP DNS support for local listener (#5758 )	2026-04-08 07:40:36 +02:00
Viktor Liu	1d920d700c	[client] Fix SSH server Stop() deadlock when sessions are active (#5717 )	2026-04-07 17:56:54 +02:00
Viktor Liu	bb85eee40a	[client] Skip down interfaces in network address collection for posture checks (#5768 )	2026-04-07 17:56:48 +02:00
Viktor Liu	aba5d6f0d2	[client] Error out on netbird expose when block inbound is enabled (#5818 )	2026-04-07 17:55:35 +02:00
Viktor Liu	0588d2dbe1	[management] Load missing service columns in pgx account loader (#5816 )	2026-04-07 14:56:56 +02:00
Pascal Fischer	14b3b77bda	[management] validate permissions on groups read with name (#5749 )	2026-04-07 14:13:09 +02:00
Zoltan Papp	6da34e483c	[client] Fix mgmProber interface to match unexported GetServerPublicKey (#5815 ) Update the mgmProber interface to use HealthCheck() instead of the now-unexported GetServerPublicKey(), aligning with the changes in the management client API.	2026-04-07 13:13:38 +02:00
Zoltan Papp	0efef671d7	[client] Unexport GetServerPublicKey, add HealthCheck method (#5735 ) * Unexport GetServerPublicKey, add HealthCheck method Internalize server key fetching into Login, Register, GetDeviceAuthorizationFlow, and GetPKCEAuthorizationFlow methods, removing the need for callers to fetch and pass the key separately. Replace the exported GetServerPublicKey with a HealthCheck() error method for connection validation, keeping IsHealthy() bool for non-blocking background monitoring. Fix test encryption to use correct key pairs (client public key as remotePubKey instead of server private key). * Refactor `doMgmLogin` to return only error, removing unused response	2026-04-07 12:18:21 +02:00
Eduard Gert	435203b13b	[proxy] Update proxy web packages (#5661 ) * [proxy] Update package-lock.json * Update packages	2026-04-07 10:35:09 +02:00
Maycon Santos	decb5dd3af	[client] Add GetSelectedClientRoutes to route manager and update DNS route check (#5802 ) - DNS resolution broke after deselecting an exit node because the route checker used all client routes (including deselected ones) to decide how to forward upstream DNS queries - Added GetSelectedClientRoutes() to the route manager that filters out deselected exit nodes, and switched the DNS route checker to use it - Confirmed fix via device testing: after deselecting exit node, DNS queries now correctly use a regular network socket instead of binding to the utun interface v0.67.4	2026-04-05 13:44:53 +02:00
Viktor Liu	28fbf96b2a	[client] Fix flaky TestServiceLifecycle/Restart on FreeBSD (#5786 )	2026-04-02 21:45:49 +02:00
Bethuel Mmbaga	9d1a37c644	[management,client] Revert gRPC client secret removal (#5781 ) * This reverts commit `e5914e4e8b` Signed-off-by: bcmmbaga <bethuelmbaga12@gmail.com> * Deprecate client secret in proto Signed-off-by: bcmmbaga <bethuelmbaga12@gmail.com> * Fix lint Signed-off-by: bcmmbaga <bethuelmbaga12@gmail.com> --------- Signed-off-by: bcmmbaga <bethuelmbaga12@gmail.com> v0.67.3	2026-04-02 18:21:00 +02:00
Viktor Liu	5bf2372c4d	[management] Fix L4 service creation deadlock on single-connection databases (#5779 )	2026-04-02 14:46:14 +02:00
Bethuel Mmbaga	c2c6396a04	[management] Allow updating embedded IdP user name and email (#5721 )	2026-04-02 13:02:10 +03:00
Misha Bragin	aaf813fc0c	Add selfhosted scaling note (#5769 ) v0.67.2	2026-04-01 19:23:39 +02:00
Vlad	d97fe84296	[management] fix race condition in the setup flow that enables creation of multiple owner users (#5754 )	2026-04-01 16:25:35 +02:00
tham-le	81f45dab21	[client] Support embed.Client on Android with netstack mode (#5623 ) * [client] Support embed.Client on Android with netstack mode embed.Client.Start() calls ConnectClient.Run() which passes an empty MobileDependency{}. On Android, the engine dereferences nil fields (IFaceDiscover, NetworkChangeListener, DnsReadyListener) causing panics. Provide complete no-op stubs so the engine's existing Android code paths work unchanged — zero modifications to engine.go: - Add androidRunOverride hook in Run() for Android-specific dispatch - Add runOnAndroidEmbed() with complete MobileDependency (all stubs) - Wire default stubs via init() in connect_android_default.go: noopIFaceDiscover, noopNetworkChangeListener, noopDnsReadyListener - Forward logPath to c.run() Tested: embed.Client starts on Android arm64, joins mesh via relay, discovers peers, localhost proxy works for TCP+UDP forwarding. * [client] Fix TestServiceParamsPath for Windows path separators Use filepath.Join in test assertions instead of hardcoded POSIX paths so the test passes on Windows where filepath.Join uses backslashes.	2026-04-01 16:19:34 +02:00

1 2 3 4 5 ...

2791 Commits