Merge branch 'main' into feature/tracegen-file-export

Samriddha9619 · web-flow · commit 872ebd9029b4 · 2025-12-05T08:53:34.000+05:30
diff --git a/.github/actions/block-pr-from-main-branch/action.yml b/.github/actions/block-pr-from-main-branch/action.yml
@@ -12,6 +12,8 @@ runs:
         echo "Branch: ${{ github.event.pull_request.head.ref }}"
 
         if [ "${{ github.event.pull_request.head.repo.fork }}" == "true" ] && [ "${{ github.event.pull_request.head.ref }}" == 'main' ]; then
-          echo "PRs from the main branch of forked repositories are not allowed."
+          echo "Error 🛑: PRs from the main branch of forked repositories are not allowed."
+          echo "  Please create a named branch and resubmit the PR."
+          echo "  See https://github.com/jaegertracing/jaeger/blob/main/CONTRIBUTING_GUIDELINES.md#branches"
           exit 1
         fi
diff --git a/CHANGELOG.md b/CHANGELOG.md
@@ -21,6 +21,58 @@ copy from UI changelog
 
 </details>
 
+v1.76.0 / v2.13.0 (2025-12-03)
+-------------------------------
+
+### Backend Changes
+
+#### 🐞 Bug fixes, Minor Improvements
+
+* Fix: register basicauth extension in component factory ([@xenonnn4w](https://github.com/xenonnn4w) in [#7668](https://github.com/jaegertracing/jaeger/pull/7668))
+
+#### 👷 CI Improvements
+
+* Make error message better ([@yurishkuro](https://github.com/yurishkuro) in [#7675](https://github.com/jaegertracing/jaeger/pull/7675))
+* Clean go cache after installing gotip as suggested. ([@Kavish-12345](https://github.com/Kavish-12345) in [#7666](https://github.com/jaegertracing/jaeger/pull/7666))
+* Fix: build test tools with stable go, not gotip ([@Kavish-12345](https://github.com/Kavish-12345) in [#7665](https://github.com/jaegertracing/jaeger/pull/7665))
+
+### 📊 UI Changes
+
+#### 🐞 Bug fixes, Minor Improvements
+
+* Add support for custom ui configuration in development mode ([@Copilot](https://github.com/apps/copilot-swe-agent) in [#3194](https://github.com/jaegertracing/jaeger-ui/pull/3194))
+* Remove duplicate antd dependencies ([@yurishkuro](https://github.com/yurishkuro) in [#3193](https://github.com/jaegertracing/jaeger-ui/pull/3193))
+* Fix css class typo in sidepanel details div ([@Copilot](https://github.com/apps/copilot-swe-agent) in [#3190](https://github.com/jaegertracing/jaeger-ui/pull/3190))
+* Reduce search form field margins for better viewport fit ([@Copilot](https://github.com/apps/copilot-swe-agent) in [#3189](https://github.com/jaegertracing/jaeger-ui/pull/3189))
+* Migrate deepdependencies/header and qualitymetrics/header from nameselector to searchableselect ([@Copilot](https://github.com/apps/copilot-swe-agent) in [#3185](https://github.com/jaegertracing/jaeger-ui/pull/3185))
+* Reorder checkbox before color by dropdown in tracestatisticsheader ([@Copilot](https://github.com/apps/copilot-swe-agent) in [#3184](https://github.com/jaegertracing/jaeger-ui/pull/3184))
+* Feat: add fuzzy search to searchableselect ([@Copilot](https://github.com/apps/copilot-swe-agent) in [#3182](https://github.com/jaegertracing/jaeger-ui/pull/3182))
+* Fix highlighting of the current tab in the main nav bar ([@SimonADW](https://github.com/SimonADW) in [#3183](https://github.com/jaegertracing/jaeger-ui/pull/3183))
+
+#### 🚧 Experimental Features
+
+* Sync themes with antd ([@yurishkuro](https://github.com/yurishkuro) in [#3196](https://github.com/jaegertracing/jaeger-ui/pull/3196))
+* Add dark theme selector ([@yurishkuro](https://github.com/yurishkuro) in [#3192](https://github.com/jaegertracing/jaeger-ui/pull/3192))
+
+#### 👷 CI Improvements
+
+* Add copyright year linter to npm lint command ([@Copilot](https://github.com/apps/copilot-swe-agent) in [#3197](https://github.com/jaegertracing/jaeger-ui/pull/3197))
+* Rename theme variables to match industry practice ([@yurishkuro](https://github.com/yurishkuro) in [#3174](https://github.com/jaegertracing/jaeger-ui/pull/3174))
+* Tweak codecov config ([@yurishkuro](https://github.com/yurishkuro) in [#3169](https://github.com/jaegertracing/jaeger-ui/pull/3169))
+
+#### ⚙️ Refactoring
+
+* Apply theme vars to common/emphasizednode ([@yurishkuro](https://github.com/yurishkuro) in [#3191](https://github.com/jaegertracing/jaeger-ui/pull/3191))
+* Fix ddg minimap border ([@yurishkuro](https://github.com/yurishkuro) in [#3188](https://github.com/jaegertracing/jaeger-ui/pull/3188))
+* Use token vars in common/utils.css ([@yurishkuro](https://github.com/yurishkuro) in [#3187](https://github.com/jaegertracing/jaeger-ui/pull/3187))
+* Apply theme vars to some shared components ([@yurishkuro](https://github.com/yurishkuro) in [#3181](https://github.com/jaegertracing/jaeger-ui/pull/3181))
+* Apply theme vars to search page ([@yurishkuro](https://github.com/yurishkuro) in [#3180](https://github.com/jaegertracing/jaeger-ui/pull/3180))
+* Use theme vars in errormessage & loadingindicator ([@yurishkuro](https://github.com/yurishkuro) in [#3177](https://github.com/jaegertracing/jaeger-ui/pull/3177))
+* Use theme vars in main page and topnav ([@yurishkuro](https://github.com/yurishkuro) in [#3176](https://github.com/jaegertracing/jaeger-ui/pull/3176))
+* Convert last remaining js files to typescript (excluding tests) ([@yurishkuro](https://github.com/yurishkuro) in [#3173](https://github.com/jaegertracing/jaeger-ui/pull/3173))
+* Convert some easy files to typescript ([@yurishkuro](https://github.com/yurishkuro) in [#3167](https://github.com/jaegertracing/jaeger-ui/pull/3167))
+
+
 v1.75.0 / v2.12.0 (2025-11-18)
 -------------------------------
 
diff --git a/CONTRIBUTING_GUIDELINES.md b/CONTRIBUTING_GUIDELINES.md
@@ -37,7 +37,7 @@ and open a pull request (PR).
 We do not assign issues to contributors. It is almost never the case that multiple
 people jump on the same issue, and practice showed that occasionally people who ask
 for an issue to be assigned to them later have a change in priorities and are unable
-to find time to finish it, which leaves the issue in limbo. 
+to find time to finish it, which leaves the issue in limbo.
 So if you have a desire to work on an issue, feel free to mention it in the comment and just submit a PR.
 
 ### Creating a pull request
@@ -49,7 +49,7 @@ If you are new to GitHub's contribution workflow, we recommend the following set
     * After you clone your forked repo, running below command
       ```bash
       git remote -v
-      ``` 
+      ```
       will show `origin`, e.g. `origin git@github.com:{username}/jaeger.git`
     * Add `upstream` remote:
       ```bash
@@ -59,7 +59,7 @@ If you are new to GitHub's contribution workflow, we recommend the following set
       ```bash
       git fetch upstream main
       ```
-    * Repoint your main branch: 
+    * Repoint your main branch:
       ```bash
       git branch --set-upstream-to=upstream/main main
       ```
@@ -70,7 +70,7 @@ Once you're ready to make changes:
   * Commit your changes, making sure **each commit is signed** ([see below](#certificate-of-origin---sign-your-work)):
     ```bash
     git commit -s -m "Your commit message"
-    ``` 
+    ```
   * You do not need to squash the commits, it will happen once the PR is merged into the official repo (but each individual commit must be signed).
   * When satisfied, push the changes. Git will likely ask for upstream destination, so you push commits like this:
     ```bash
@@ -182,5 +182,4 @@ git push --force
 
 ## Branches
 
-Upstream repository should contain only maintenance branches (e.g. `release-1.0`). For feature
-branches use forked repository.
+Before submitting a PR make sure to create a named branch in your forked repository. Our CI will fail if you submit a PR from the `main` branch. If that happens, just create a new branch and re-submit the PR from that branch.
diff --git a/RELEASE.md b/RELEASE.md
@@ -88,9 +88,9 @@ Here are the release managers for future versions with the tentative release dat
 
 | Version | Release Manager | Tentative release date |
 |---------|-----------------|------------------------|
-| 2.13.0  | @joe-elliott    | 3 December  2025       |
 | 2.14.0  | @mahadzaryab1   | 7 January   2026       |
 | 2.15.0  | @jkowall        | 4 February  2026       |
 | 2.16.0  | @yurishkuro     | 5 March     2026       |
 | 2.17.0  | @albertteoh     | 1 April     2026       |
 | 2.18.0  | @pavolloffay    | 6 May       2026       |
+| 2.19.0  | @joe-elliott    | 3 June      2026       |
diff --git a/cmd/jaeger/internal/components.go b/cmd/jaeger/internal/components.go
@@ -7,6 +7,7 @@ import (
 	"github.com/open-telemetry/opentelemetry-collector-contrib/connector/spanmetricsconnector"
 	"github.com/open-telemetry/opentelemetry-collector-contrib/exporter/kafkaexporter"
 	"github.com/open-telemetry/opentelemetry-collector-contrib/exporter/prometheusexporter"
+	"github.com/open-telemetry/opentelemetry-collector-contrib/extension/basicauthextension"
 	"github.com/open-telemetry/opentelemetry-collector-contrib/extension/healthcheckv2extension"
 	"github.com/open-telemetry/opentelemetry-collector-contrib/extension/pprofextension"
 	"github.com/open-telemetry/opentelemetry-collector-contrib/extension/sigv4authextension"
@@ -76,6 +77,7 @@ func (b builders) build() (otelcol.Factories, error) {
 		zpagesextension.NewFactory(),
 
 		// add-ons
+		basicauthextension.NewFactory(),
 		sigv4authextension.NewFactory(),
 		jaegerquery.NewFactory(),
 		jaegerstorage.NewFactory(),
diff --git a/cmd/jaeger/internal/integration/e2e_integration.go b/cmd/jaeger/internal/integration/e2e_integration.go
@@ -17,7 +17,7 @@ import (
 	"github.com/stretchr/testify/require"
 	"go.uber.org/zap"
 	"go.uber.org/zap/zaptest"
-	"gopkg.in/yaml.v3"
+	"go.yaml.in/yaml/v3"
 
 	"github.com/jaegertracing/jaeger/cmd/jaeger/internal/integration/storagecleaner"
 	"github.com/jaegertracing/jaeger/internal/storage/integration"
diff --git a/docs/adr/cassandra-find-traces-duration.md b/docs/adr/cassandra-find-traces-duration.md
@@ -0,0 +1,170 @@
+# Cassandra FindTraceIDs Duration Query Behavior
+
+## Status
+
+Accepted
+
+## Context
+
+The Cassandra spanstore implementation in Jaeger handles trace queries with duration filters (DurationMin/DurationMax) through a separate code path that cannot efficiently intersect with other query parameters like tags or general operation name filters. This behavior differs from other storage backends like Badger and may seem counterintuitive to users.
+
+### Data Model and Cassandra Constraints
+
+Cassandra's data model imposes specific constraints on query patterns. The `duration_index` table is defined with the following schema structure (as referenced in the CQL insertion query in [`internal/storage/v1/cassandra/spanstore/writer.go`](../../internal/storage/v1/cassandra/spanstore/writer.go)):
+
+```cql
+INSERT INTO duration_index(service_name, operation_name, bucket, duration, start_time, trace_id)
+VALUES (?, ?, ?, ?, ?, ?)
+```
+
+This schema uses a composite partition key consisting of `service_name`, `operation_name`, and `bucket` (an hourly time bucket), with `duration` as a clustering column. In Cassandra, **partition keys require equality constraints** in WHERE clauses - you cannot perform range queries or arbitrary intersections across different partition keys efficiently.
+
+### Duration Index Structure
+
+The duration index is bucketed by hour to limit partition size and improve query performance. From [`internal/storage/v1/cassandra/spanstore/writer.go`](../../internal/storage/v1/cassandra/spanstore/writer.go) (line 57):
+
+```go
+durationBucketSize = time.Hour
+```
+
+When a span is indexed, its start time is rounded to the nearest hour bucket (line 231 in writer.go):
+
+```go
+timeBucket := startTime.Round(durationBucketSize)
+```
+
+The indexing function in `indexByDuration` (lines 229-243) creates two index entries per span:
+1. One indexed by service name alone (with empty operation name)
+2. One indexed by both service name and operation name
+
+```go
+indexByOperationName("")                 // index by service name alone
+indexByOperationName(span.OperationName) // index by service name and operation name
+```
+
+### Query Path Implementation
+
+In [`internal/storage/v1/cassandra/spanstore/reader.go`](../../internal/storage/v1/cassandra/spanstore/reader.go), the `findTraceIDs` method (lines 275-301) performs an early return when duration parameters are present:
+
+```go
+func (s *SpanReader) findTraceIDs(ctx context.Context, traceQuery *spanstore.TraceQueryParameters) (dbmodel.UniqueTraceIDs, error) {
+	if traceQuery.DurationMin != 0 || traceQuery.DurationMax != 0 {
+		return s.queryByDuration(ctx, traceQuery)
+	}
+	// ... other query paths
+}
+```
+
+This early return means that when a duration query is detected, **all other query parameters except ServiceName and OperationName are effectively ignored** (tags, for instance, are not processed).
+
+The `queryByDuration` method (lines 333-375) iterates over hourly buckets within the query time range and issues a Cassandra query for each bucket:
+
+```go
+startTimeByHour := traceQuery.StartTimeMin.Round(durationBucketSize)
+endTimeByHour := traceQuery.StartTimeMax.Round(durationBucketSize)
+
+for timeBucket := endTimeByHour; timeBucket.After(startTimeByHour) || timeBucket.Equal(startTimeByHour); timeBucket = timeBucket.Add(-1 * durationBucketSize) {
+	query := s.session.Query(
+		queryByDuration,
+		timeBucket,
+		traceQuery.ServiceName,
+		traceQuery.OperationName,
+		minDurationMicros,
+		maxDurationMicros,
+		traceQuery.NumTraces*limitMultiple)
+	// execute query...
+}
+```
+
+Each query specifies exact values for `bucket`, `service_name`, and `operation_name` (the partition key components), along with a range filter on `duration` (the clustering column). The query definition (lines 51-55) is:
+
+```cql
+SELECT trace_id
+FROM duration_index
+WHERE bucket = ? AND service_name = ? AND operation_name = ? AND duration > ? AND duration < ?
+LIMIT ?
+```
+
+### Why Not Intersect with Other Indices?
+
+Unlike storage backends such as Badger (which can perform hash-joins and arbitrary index intersections), Cassandra's partition-based architecture makes cross-index intersections expensive and impractical:
+
+1. **Partition key constraints**: The duration index requires equality on `(service_name, operation_name, bucket)`. You cannot efficiently query across multiple operations or join with the tag index without scanning many partitions.
+   
+2. **No server-side joins**: Cassandra does not support server-side joins. To intersect duration results with tag results, the client would need to:
+   - Query the duration index for all matching trace IDs
+   - Query the tag index for all matching trace IDs
+   - Perform a client-side intersection
+   
+   This would be inefficient for large result sets and would require fetching potentially many trace IDs over the network.
+
+3. **Hourly bucket iteration**: The duration query already iterates over hourly buckets. Adding tag intersections would multiply the number of queries and result sets to merge.
+
+### Comparison with Badger
+
+The Badger storage backend handles duration queries differently. In [`internal/storage/v1/badger/spanstore/reader.go`](../../internal/storage/v1/badger/spanstore/reader.go) (around line 486), the `FindTraceIDs` method performs duration queries and then uses the results as a filter (`hashOuter`) that can be intersected with other index results:
+
+```go
+if query.DurationMax != 0 || query.DurationMin != 0 {
+	plan.hashOuter = r.durationQueries(plan, query)
+}
+```
+
+Badger uses an embedded key-value store where range scans and in-memory filtering are efficient, allowing it to merge results from multiple indices. This is a fundamental difference from Cassandra's distributed, partition-oriented design.
+
+## Decision
+
+**The Cassandra spanstore will continue to treat duration queries as a separate query path that does not intersect with tag indices or other non-service/operation filters.**
+
+When a `TraceQueryParameters` contains `DurationMin` or `DurationMax`:
+- The query will use the `duration_index` table exclusively
+- Only `ServiceName` and `OperationName` parameters will be respected (used as partition key components)
+- Tag filters and other parameters will be ignored
+- The code will iterate over hourly time buckets within the query time range
+
+This approach is documented in code comments and in this ADR to set proper expectations.
+
+## Consequences
+
+### Positive
+
+1. **Performance**: Duration queries execute efficiently by scanning only relevant Cassandra partitions (scoped to service, operation, and hourly bucket).
+2. **Scalability**: The bucketed partition strategy prevents hot partitions and distributes load across the cluster.
+3. **Simplicity**: The implementation is straightforward and leverages Cassandra's strengths (partition-scoped queries with range filtering on clustering columns).
+
+### Negative
+
+1. **Limited query expressiveness**: Users cannot combine duration filters with tag filters in a single query. They must choose one or the other.
+2. **Expectation mismatch**: Users familiar with other backends (like Badger) may expect duration and tags to be combinable.
+3. **Workarounds required**: Applications that need both duration and tag filtering must:
+   - Issue separate queries (one with duration, one with tags)
+   - Perform client-side intersection of results
+   - Or use a different storage backend that supports combined queries
+
+### Guidance for Users
+
+- **When using Cassandra spanstore**: Be aware that specifying `DurationMin` or `DurationMax` will cause tag filters to be ignored. Validate that `ErrDurationAndTagQueryNotSupported` is returned if both are specified (enforced in `validateQuery` at line 227-229 in reader.go).
+  
+- **For combined filtering needs**: Consider using the Badger backend, or implement client-side filtering by:
+  1. Querying with duration filters to get a candidate set of trace IDs
+  2. Fetching those traces
+  3. Filtering the results by tag values in your application code
+
+- **Query design**: Structure queries to leverage the indices available. Use `ServiceName` and `OperationName` in conjunction with duration queries for best results.
+
+## References
+
+- Implementation files:
+  - [`internal/storage/v1/cassandra/spanstore/reader.go`](../../internal/storage/v1/cassandra/spanstore/reader.go) - Query logic and duration query path
+  - [`internal/storage/v1/cassandra/spanstore/writer.go`](../../internal/storage/v1/cassandra/spanstore/writer.go) - Duration index schema and insertion logic
+  - [`internal/storage/v1/badger/spanstore/reader.go`](../../internal/storage/v1/badger/spanstore/reader.go) - Badger implementation for comparison
+
+- Cassandra documentation:
+  - [Cassandra Data Modeling](https://cassandra.apache.org/doc/latest/data_modeling/index.html)
+  - [CQL Partition Keys and Clustering Columns](https://cassandra.apache.org/doc/latest/cql/ddl.html#partition-key)
+
+- Related code:
+  - `durationIndex` constant (writer.go line 47-50): CQL insert statement
+  - `queryByDuration` constant (reader.go line 51-55): CQL select statement
+  - `durationBucketSize` constant (writer.go line 57): Hourly bucketing
+  - Error `ErrDurationAndTagQueryNotSupported` (reader.go line 77): Validation that prevents combining duration and tag queries
diff --git a/docs/adr/index.md b/docs/adr/index.md
@@ -0,0 +1,11 @@
+# Architecture Decision Records (ADRs)
+
+This directory contains Architecture Decision Records (ADRs) for the Jaeger project. ADRs document important architectural decisions made during the development of Jaeger, including the context, decision, and consequences of each choice.
+
+## What is an ADR?
+
+An Architecture Decision Record (ADR) is a document that captures an important architectural decision made along with its context and consequences. ADRs help teams understand why certain decisions were made and provide historical context for future contributors.
+
+## ADRs in This Repository
+
+- [Cassandra FindTraceIDs Duration Query Behavior](cassandra-find-traces-duration.md) - Explains why duration queries in the Cassandra spanstore use a separate code path and cannot be efficiently combined with other query parameters.
diff --git a/go.mod b/go.mod
@@ -2,7 +2,7 @@ module github.com/jaegertracing/jaeger
 
 go 1.24.6
 
-toolchain go1.25.3
+toolchain go1.25.4
 
 require (
 	github.com/ClickHouse/ch-go v0.69.0
diff --git a/internal/storage/v1/cassandra/spanstore/reader.go b/internal/storage/v1/cassandra/spanstore/reader.go
@@ -273,6 +273,8 @@ func (s *SpanReader) FindTraceIDs(ctx context.Context, traceQuery *spanstore.Tra
 }
 
 func (s *SpanReader) findTraceIDs(ctx context.Context, traceQuery *spanstore.TraceQueryParameters) (dbmodel.UniqueTraceIDs, error) {
+	// See docs/adr/cassandra-find-traces-duration.md for rationale: duration queries use the duration_index
+	// and are handled as a separate path. Other query parameters (like tags) are ignored when duration is specified.
 	if traceQuery.DurationMin != 0 || traceQuery.DurationMax != 0 {
 		return s.queryByDuration(ctx, traceQuery)
 	}
diff --git a/jaeger-ui b/jaeger-ui
@@ -1 +1 @@
-Subproject commit d83cb35c682151485818b0d5bbaead44dddade6a
+Subproject commit 1ceadb6f6fb29774bfaa77a6148499ff7a04902b
diff --git a/scripts/build/docker/debug/Dockerfile b/scripts/build/docker/debug/Dockerfile
diff --git a/scripts/release/notes.py b/scripts/release/notes.py

Original file line number	Diff line number	Diff line change
`@@ -273,6 +273,8 @@ func (s SpanReader) FindTraceIDs(ctx context.Context, traceQuery spanstore.Tra`
`273`	`273`	`}`
`274`	`274`
`275`	`275`	`func (s SpanReader) findTraceIDs(ctx context.Context, traceQuery spanstore.TraceQueryParameters) (dbmodel.UniqueTraceIDs, error) {`
	`276`	`+ // See docs/adr/cassandra-find-traces-duration.md for rationale: duration queries use the duration_index`
	`277`	`+ // and are handled as a separate path. Other query parameters (like tags) are ignored when duration is specified.`
`276`	`278`	`if traceQuery.DurationMin != 0 \|\| traceQuery.DurationMax != 0 {`
`277`	`279`	`return s.queryByDuration(ctx, traceQuery)`
`278`	`280`	`}`