Skip to content

Comment out the verification task for traces transactions count #832

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Jul 6, 2025

Conversation

medvedev1088
Copy link
Member

[2025-07-05, 09:49:31 UTC] {taskinstance.py:1084} INFO - Dependencies all met for <TaskInstance: ethereum_load_dag.verify_traces_transactions_count scheduled__2025-07-04T08:30:00+00:00 [queued]>
[2025-07-05, 09:49:31 UTC] {taskinstance.py:1084} INFO - Dependencies all met for <TaskInstance: ethereum_load_dag.verify_traces_transactions_count scheduled__2025-07-04T08:30:00+00:00 [queued]>
[2025-07-05, 09:49:31 UTC] {taskinstance.py:1280} INFO - 
--------------------------------------------------------------------------------
[2025-07-05, 09:49:31 UTC] {taskinstance.py:1281} INFO - Starting attempt 6 of 6
[2025-07-05, 09:49:31 UTC] {taskinstance.py:1282} INFO - 
--------------------------------------------------------------------------------
[2025-07-05, 09:49:31 UTC] {taskinstance.py:1301} INFO - Executing <Task(BigQueryInsertJobOperator): verify_traces_transactions_count> on 2025-07-04 08:30:00+00:00
[2025-07-05, 09:49:31 UTC] {standard_task_runner.py:55} INFO - Started process 141142 to run task
[2025-07-05, 09:49:31 UTC] {standard_task_runner.py:82} INFO - Running: ['airflow', 'tasks', 'run', 'ethereum_load_dag', 'verify_traces_transactions_count', 'scheduled__2025-07-04T08:30:00+00:00', '--job-id', '3432315', '--raw', '--subdir', 'DAGS_FOLDER/ethereum_load_dag.py', '--cfg-path', '/tmp/tmp1b0mi7gl']
[2025-07-05, 09:49:32 UTC] {standard_task_runner.py:83} INFO - Job 3432315: Subtask verify_traces_transactions_count
[2025-07-05, 09:49:32 UTC] {task_command.py:392} INFO - Running <TaskInstance: ethereum_load_dag.verify_traces_transactions_count scheduled__2025-07-04T08:30:00+00:00 [running]> on host airflow-worker-snwkk
[2025-07-05, 09:49:32 UTC] {taskinstance.py:1509} INFO - Exporting the following env vars:
[email protected]
AIRFLOW_CTX_DAG_OWNER=airflow
AIRFLOW_CTX_DAG_ID=ethereum_load_dag
AIRFLOW_CTX_TASK_ID=verify_traces_transactions_count
AIRFLOW_CTX_EXECUTION_DATE=2025-07-04T08:30:00+00:00
AIRFLOW_CTX_TRY_NUMBER=6
AIRFLOW_CTX_DAG_RUN_ID=scheduled__2025-07-04T08:30:00+00:00
[2025-07-05, 09:49:32 UTC] {base.py:73} INFO - Using connection ID 'google_cloud_default' for task execution.
[2025-07-05, 09:49:32 UTC] {bigquery.py:2691} INFO - Executing: {'query': {'query': "select if(\n(\nselect count(transaction_hash)\nfrom `bigquery-public-data.crypto_ethereum.traces`\nwhere trace_address is null and transaction_hash is not null\n    and date(block_timestamp) <= '2025-07-04'\n) =\n(\nselect count(*)\nfrom `bigquery-public-data.crypto_ethereum.transactions`\nwhere date(block_timestamp) <= '2025-07-04'\n), 1,\ncast((select 'Total number of traces with null address is not equal to transaction count on 2025-07-04') as int64))", 'useLegacySql': False}}'
[2025-07-05, 09:49:32 UTC] {credentials_provider.py:323} INFO - Getting connection using `google.auth.default()` since no key file is defined for hook.
[2025-07-05, 09:49:32 UTC] {bigquery.py:1546} INFO - Inserting job airflow_ethereum_load_dag_verify_traces_transactions_count_2025_07_04T08_30_00_00_00_66151fc41f41a9bb78dbb3b828e0aa56
[2025-07-05, 09:49:45 UTC] {taskinstance.py:1770} ERROR - Task failed with exception
Traceback (most recent call last):
  File "/opt/python3.8/lib/python3.8/site-packages/airflow/providers/google/cloud/operators/bigquery.py", line 2742, in execute
    job.result(timeout=self.result_timeout, retry=self.result_retry)
  File "/opt/python3.8/lib/python3.8/site-packages/google/cloud/bigquery/job/query.py", line 1499, in result
    do_get_result()
  File "/opt/python3.8/lib/python3.8/site-packages/google/cloud/bigquery/job/query.py", line 1489, in do_get_result
    super(QueryJob, self).result(retry=retry, timeout=timeout)
  File "/opt/python3.8/lib/python3.8/site-packages/google/cloud/bigquery/job/base.py", line 728, in result
    return super(_AsyncJob, self).result(timeout=timeout, **kwargs)
  File "/opt/python3.8/lib/python3.8/site-packages/google/api_core/future/polling.py", line 137, in result
    raise self._exception
google.api_core.exceptions.BadRequest: 400 Bad int64 value: Total number of traces with null...

Location: US
Job ID: airflow_ethereum_load_dag_verify_traces_transactions_count_2025_07_04T08_30_00_00_00_66151fc41f41a9bb78dbb3b828e0aa56

[2025-07-05, 09:49:46 UTC] {taskinstance.py:1319} INFO - Marking task as FAILED. dag_id=ethereum_load_dag, task_id=verify_traces_transactions_count, execution_date=20250704T083000, start_date=20250705T094931, end_date=20250705T094946
[2025-07-05, 09:49:46 UTC] {warnings.py:109} WARNING - /opt/python3.8/lib/python3.8/site-packages/airflow/providers/sendgrid/utils/emailer.py:123: DeprecationWarning: Fetching Sendgrid credentials from environment variables will be deprecated in a future release. Please set credentials using a connection instead.
  _post_sendgrid_mail(mail.get(), conn_id)

[2025-07-05, 09:49:46 UTC] {emailer.py:145} INFO - Email with subject Airflow alert: <TaskInstance: ethereum_load_dag.verify_traces_transactions_count scheduled__2025-07-04T08:30:00+00:00 [failed]> is successfully sent to recipients: [{'to': [{'email': '[email protected]'}]}]
[2025-07-05, 09:49:46 UTC] {standard_task_runner.py:100} ERROR - Failed to execute job 3432315 for task verify_traces_transactions_count (400 Bad int64 value: Total number of traces with null...

Location: US
Job ID: airflow_ethereum_load_dag_verify_traces_transactions_count_2025_07_04T08_30_00_00_00_66151fc41f41a9bb78dbb3b828e0aa56
; 141142)
[2025-07-05, 09:49:46 UTC] {local_task_job.py:208} INFO - Task exited with return code 1
[2025-07-05, 09:49:46 UTC] {taskinstance.py:2581} INFO - 0 downstream tasks scheduled from follow-on schedule check

Failing SQL:

select if(
(
select count(transaction_hash)
from `bigquery-public-data.crypto_ethereum.traces`
where trace_address is null and transaction_hash is not null
    and date(block_timestamp) <= '2025-07-04'
) =
(
select count(*)
from `bigquery-public-data.crypto_ethereum.transactions`
where date(block_timestamp) <= '2025-07-04'
), 1,
cast((select 'Total number of traces with null address is not equal to transaction count on 2025-07-03') as int64))

…e load DAG to prevent execution during the build process.
@medvedev1088 medvedev1088 merged commit 27d0fb0 into master Jul 6, 2025
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant