Skip to content

Commit 811a464

Browse files
committed
Addressing feedback - part 1
Signed-off-by: Hemanth Malla <[email protected]>
1 parent babca39 commit 811a464

File tree

1 file changed

+19
-11
lines changed

1 file changed

+19
-11
lines changed

cilium/CFP-30984-dns-proxy-ha-v2.md

+19-11
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@
66

77
**Cilium Release:** 1.17
88

9-
**Authors:** Hemanth Malla <[email protected]>, Vipul Singh <[email protected]>
9+
**Authors:** Hemanth Malla <[email protected]>, Vipul Singh <[email protected]>, Tamilmani Manoharan <[email protected]>
1010

1111
## Summary
1212

@@ -32,7 +32,7 @@ Users rely on toFQDN policies to enforce network policies against traffic to des
3232

3333
### Overview
3434

35-
There are two parts to enforcing toFQDN network policy. L4 policy enforcement against IP addresses resolved from an FQDN and policy enforcement on DNS requests (L7 DNS policy). To enforce L4 policy, per endpoint policy bpf maps need to be updated. We'd like to avoid multiple processes writing entries to policy maps, so the standalone DNS proxy (SDP) needs a mechanism to notify agent of newly resolved FQDN <> IP address mappings. This CFP proposes exposing a new gRPC streaming API from the cilium agent. Since the connection is bi-directional, the cilium agent can reuse the same connection to notify the SDP of L7 DNS policy changes.
35+
There are two parts to enforcing toFQDN network policy. L3/L4 policy enforcement against IP addresses resolved from an FQDN and policy enforcement on DNS requests (L7 DNS policy). To enforce L3/L4 policy, per endpoint policy bpf maps need to be updated. We'd like to avoid multiple processes writing entries to policy maps, so the standalone DNS proxy (SDP) needs a mechanism to notify agent of newly resolved FQDN <> IP address mappings. This CFP proposes exposing a new gRPC streaming API from the cilium agent. Since the connection is bi-directional, the cilium agent can reuse the same connection to notify the SDP of L7 DNS policy changes.
3636

3737
Additionally, SDP needs to translate the IP address to cilium identity to enforce the policy. Our proposal involves retrieving the identity mapping from the cilium_ipcache BPF map. Currently L7 proxy (envoy) relies on accessing ipcache directly as well. We aren't aware of any efforts to introduce an abstraction to avoid reading bpf maps owned by the cilium agent beyond the agent process. If / when such abstraction is introduced, SDP can also be updated to implement a similar mechanism. We brainstormed a few options on how the API might look like if we exchange IP to identity mappings via the API as well, but it brings in a lot of additional complexity to keep the mappings in sync as endpoints churn. This CFP will focus on the contract between SDP and Cilium agent to exchange minimum information for implementing the high availability mode.
3838

@@ -50,7 +50,8 @@ message FQDNMapping {
5050
repeated bytes IPS = 2; // Resolved IP addresses
5151
uint32 TTL = 3;
5252
uint64 source_identity = 4; // Identity of the client making the DNS request
53-
int dns_response_code = 5;
53+
bytes source_ip = 5; // IP address of the client making the DNS request
54+
int dns_response_code = 6;
5455
}
5556
```
5657
Response :
@@ -67,15 +68,14 @@ Request :
6768
```
6869
message DNSPolicy {
6970
uint64 source_identity = 1; // Identity of the workload this L7 DNS policy should apply to
70-
repeated string dns_pattern = 2; // Allowed DNS pattern this identity is allowed to resolve
71-
uint64 dns_server_identity = 3; // Identity of destination DNS server
72-
uint16 dns_server_port = 4;
73-
uint8 dns_server_proto = 5;
71+
repeated string dns_pattern = 2; // Allowed DNS pattern this identity is allowed to resolve.
72+
repeated uint64 dns_server_identity = 3; // Identity of destination DNS server
73+
repeated uint16 dns_server_port = 4;
74+
repeated uint8 dns_server_proto = 5;
7475
}
7576
76-
7777
message DNSPolicies {
78-
repeated DNSPolicy l7_dns_policy = 1;
78+
repeated DNSPolicy egress_l7_dns_policy = 1;
7979
}
8080
8181
```
@@ -94,7 +94,7 @@ SDP and agent's DNS proxy will run on the same port using SO_REUSEPORT. By defau
9494

9595
### High Level Information Flow
9696

97-
* Agent starts up with gRPC streaming service.
97+
* Agent starts up with gRPC streaming service (only after resources are synced from k8s and ipcache bpf map is populated)
9898
* SDP starts up.
9999
* Connects to gRPC service, retrying periodically until success.
100100
* Agent sends current snapshot for L7 DNS Policy enforcement via UpdatesDNSRules to SDP.
@@ -106,4 +106,12 @@ SDP and agent's DNS proxy will run on the same port using SO_REUSEPORT. By defau
106106
* Check against policy snapshot if this identity is allowed to resolve the current DNS name and is allowed to talk to DNS server target identity (also needs lookup).
107107
* Make upstream DNS request from SDP.
108108
* On response, SDP invokes UpdatesMappings() to notify agent of new mappings.
109-
* Release DNS response after success from UpdatesMappings() / timeout.
109+
* Release DNS response after success from UpdatesMappings() / timeout.
110+
111+
### Handling SDP <> Agent re-connections
112+
113+
* When the agent is unavailable, SDP will periodically attempt to re-connect to the streaming service. Any FQDN<>IP mappings resolved when the agent is down will be cached in SDP and `UpdatesMappings` will be retried after establishing the connection.
114+
* A new bpf map for ipcache is populated on agent startup, so SDP needs to re-open the ipcache bpf map when the connection is re-established. See https://github.com/cilium/cilium/pull/32864 for similar handling in envoy.
115+
* On a new connection from SDP, the agent will invoke `UpdatesDNSRules` to notify SDP of all L7 DNS policy rules.
116+
117+
* SDP will not listen on the DNS proxy port until a connection is established with cilium agent and initial L7 DNS policy rules are received. Meanwhile, built-in DNS proxy will continue to serve requests. SDP relies on cilium agent for initial bootstrap. In future, we could make SDP retrieve initial policy information from other sources, but this is not in scope for this CFP.

0 commit comments

Comments
 (0)