You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -32,7 +32,7 @@ Users rely on toFQDN policies to enforce network policies against traffic to des
32
32
33
33
### Overview
34
34
35
-
There are two parts to enforcing toFQDN network policy. L4 policy enforcement against IP addresses resolved from an FQDN and policy enforcement on DNS requests (L7 DNS policy). To enforce L4 policy, per endpoint policy bpf maps need to be updated. We'd like to avoid multiple processes writing entries to policy maps, so the standalone DNS proxy (SDP) needs a mechanism to notify agent of newly resolved FQDN <> IP address mappings. This CFP proposes exposing a new gRPC streaming API from the cilium agent. Since the connection is bi-directional, the cilium agent can reuse the same connection to notify the SDP of L7 DNS policy changes.
35
+
There are two parts to enforcing toFQDN network policy. L3/L4 policy enforcement against IP addresses resolved from an FQDN and policy enforcement on DNS requests (L7 DNS policy). To enforce L3/L4 policy, per endpoint policy bpf maps need to be updated. We'd like to avoid multiple processes writing entries to policy maps, so the standalone DNS proxy (SDP) needs a mechanism to notify agent of newly resolved FQDN <> IP address mappings. This CFP proposes exposing a new gRPC streaming API from the cilium agent. Since the connection is bi-directional, the cilium agent can reuse the same connection to notify the SDP of L7 DNS policy changes.
36
36
37
37
Additionally, SDP needs to translate the IP address to cilium identity to enforce the policy. Our proposal involves retrieving the identity mapping from the cilium_ipcache BPF map. Currently L7 proxy (envoy) relies on accessing ipcache directly as well. We aren't aware of any efforts to introduce an abstraction to avoid reading bpf maps owned by the cilium agent beyond the agent process. If / when such abstraction is introduced, SDP can also be updated to implement a similar mechanism. We brainstormed a few options on how the API might look like if we exchange IP to identity mappings via the API as well, but it brings in a lot of additional complexity to keep the mappings in sync as endpoints churn. This CFP will focus on the contract between SDP and Cilium agent to exchange minimum information for implementing the high availability mode.
38
38
@@ -50,7 +50,8 @@ message FQDNMapping {
50
50
repeated bytes IPS = 2; // Resolved IP addresses
51
51
uint32 TTL = 3;
52
52
uint64 source_identity = 4; // Identity of the client making the DNS request
53
-
int dns_response_code = 5;
53
+
bytes source_ip = 5; // IP address of the client making the DNS request
54
+
int dns_response_code = 6;
54
55
}
55
56
```
56
57
Response :
@@ -67,15 +68,14 @@ Request :
67
68
```
68
69
message DNSPolicy {
69
70
uint64 source_identity = 1; // Identity of the workload this L7 DNS policy should apply to
70
-
repeated string dns_pattern = 2; // Allowed DNS pattern this identity is allowed to resolve
71
-
uint64 dns_server_identity = 3; // Identity of destination DNS server
72
-
uint16 dns_server_port = 4;
73
-
uint8 dns_server_proto = 5;
71
+
repeated string dns_pattern = 2; // Allowed DNS pattern this identity is allowed to resolve.
72
+
repeated uint64 dns_server_identity = 3; // Identity of destination DNS server
73
+
repeated uint16 dns_server_port = 4;
74
+
repeated uint8 dns_server_proto = 5;
74
75
}
75
76
76
-
77
77
message DNSPolicies {
78
-
repeated DNSPolicy l7_dns_policy = 1;
78
+
repeated DNSPolicy egress_l7_dns_policy = 1;
79
79
}
80
80
81
81
```
@@ -94,7 +94,7 @@ SDP and agent's DNS proxy will run on the same port using SO_REUSEPORT. By defau
94
94
95
95
### High Level Information Flow
96
96
97
-
* Agent starts up with gRPC streaming service.
97
+
* Agent starts up with gRPC streaming service (only after resources are synced from k8s and ipcache bpf map is populated)
98
98
* SDP starts up.
99
99
* Connects to gRPC service, retrying periodically until success.
100
100
* Agent sends current snapshot for L7 DNS Policy enforcement via UpdatesDNSRules to SDP.
@@ -106,4 +106,12 @@ SDP and agent's DNS proxy will run on the same port using SO_REUSEPORT. By defau
106
106
* Check against policy snapshot if this identity is allowed to resolve the current DNS name and is allowed to talk to DNS server target identity (also needs lookup).
107
107
* Make upstream DNS request from SDP.
108
108
* On response, SDP invokes UpdatesMappings() to notify agent of new mappings.
109
-
* Release DNS response after success from UpdatesMappings() / timeout.
109
+
* Release DNS response after success from UpdatesMappings() / timeout.
110
+
111
+
### Handling SDP <> Agent re-connections
112
+
113
+
* When the agent is unavailable, SDP will periodically attempt to re-connect to the streaming service. Any FQDN<>IP mappings resolved when the agent is down will be cached in SDP and `UpdatesMappings` will be retried after establishing the connection.
114
+
* A new bpf map for ipcache is populated on agent startup, so SDP needs to re-open the ipcache bpf map when the connection is re-established. See https://github.com/cilium/cilium/pull/32864 for similar handling in envoy.
115
+
* On a new connection from SDP, the agent will invoke `UpdatesDNSRules` to notify SDP of all L7 DNS policy rules.
116
+
117
+
* SDP will not listen on the DNS proxy port until a connection is established with cilium agent and initial L7 DNS policy rules are received. Meanwhile, built-in DNS proxy will continue to serve requests. SDP relies on cilium agent for initial bootstrap. In future, we could make SDP retrieve initial policy information from other sources, but this is not in scope for this CFP.
0 commit comments