Rename the conflicting environment variable LOGLEVEL to LM_EVAL_LOG_LEVEL (#3418)

fxmarty-amd · baberabb · web-flow · commit a7da4568307e · 2025-11-27T02:20:38.000+05:00
* fix log level env variable name

* change to `LMEVAL_LOG_LEVEL`

---------

Co-authored-by: Baber &lt;baber@hey.com&gt;
diff --git a/docs/CONTRIBUTING.md b/docs/CONTRIBUTING.md
@@ -33,6 +33,10 @@ We use [pytest](https://docs.pytest.org/en/latest/) for running unit tests. All
 python -m pytest --showlocals -s -vv -n=auto --ignore=tests/models/test_openvino.py
 ```
 
+## Verbose logging
+
+You can enable verbose logging with the environment variable `LMEVAL_LOG_LEVEL="debug"`.
+
 ## Contributor License Agreement
 
 We ask that new contributors agree to a Contributor License Agreement affirming that EleutherAI has the rights to use your contribution to our library.
diff --git a/docs/new_task_guide.md b/docs/new_task_guide.md
@@ -41,7 +41,7 @@ and rename the folders and YAML file(s) as desired.
 All data downloading and management is handled through the HuggingFace (**HF**) [`datasets`](https://github.com/huggingface/datasets) API. So, the first thing you should do is check to see if your task's dataset is already provided in their catalog [here](https://huggingface.co/datasets). If it's not in there, please consider adding it to their Hub to make it accessible to a wider user base by following their [new dataset guide](https://github.com/huggingface/datasets/blob/main/ADD_NEW_DATASET.md)
 .
 > [!TIP]
-> To test your task, we recommend using verbose logging using `export LOGLEVEL = DEBUG` in your shell before running the evaluation script. This will help you debug any issues that may arise.
+> To test your task, we recommend using verbose logging using `export LMEVAL_LOG_LEVEL="DEBUG"` in your shell before running the evaluation script. This will help you debug any issues that may arise.
 Once you have a HuggingFace dataset prepared for your task, we want to assign our new YAML to use this dataset:
 
 ```yaml
diff --git a/examples/lm-eval-overview.ipynb b/examples/lm-eval-overview.ipynb
@@ -314,61 +314,12 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 4,
+   "execution_count": null,
    "metadata": {
     "id": "LOUHK7PtQfq4"
    },
-   "outputs": [
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "2023-11-29:11:54:55,156 INFO     [utils.py:160] NumExpr defaulting to 2 threads.\n",
-      "2023-11-29 11:54:55.942051: E tensorflow/compiler/xla/stream_executor/cuda/cuda_dnn.cc:9342] Unable to register cuDNN factory: Attempting to register factory for plugin cuDNN when one has already been registered\n",
-      "2023-11-29 11:54:55.942108: E tensorflow/compiler/xla/stream_executor/cuda/cuda_fft.cc:609] Unable to register cuFFT factory: Attempting to register factory for plugin cuFFT when one has already been registered\n",
-      "2023-11-29 11:54:55.942142: E tensorflow/compiler/xla/stream_executor/cuda/cuda_blas.cc:1518] Unable to register cuBLAS factory: Attempting to register factory for plugin cuBLAS when one has already been registered\n",
-      "2023-11-29 11:54:57.066802: W tensorflow/compiler/tf2tensorrt/utils/py_utils.cc:38] TF-TRT Warning: Could not find TensorRT\n",
-      "2023-11-29:11:55:00,954 INFO     [__main__.py:132] Verbosity set to INFO\n",
-      "2023-11-29:11:55:11,038 WARNING  [__main__.py:138]  --limit SHOULD ONLY BE USED FOR TESTING.REAL METRICS SHOULD NOT BE COMPUTED USING LIMIT.\n",
-      "2023-11-29:11:55:11,038 INFO     [__main__.py:143] Including path: ./\n",
-      "2023-11-29:11:55:11,046 INFO     [__main__.py:205] Selected Tasks: ['demo_boolq']\n",
-      "2023-11-29:11:55:11,047 WARNING  [evaluator.py:93] generation_kwargs specified through cli, these settings will be used over set parameters in yaml tasks.\n",
-      "2023-11-29:11:55:11,110 INFO     [huggingface.py:120] Using device 'cuda'\n",
-      "config.json: 100% 571/571 [00:00<00:00, 2.87MB/s]\n",
-      "model.safetensors: 100% 5.68G/5.68G [00:32<00:00, 173MB/s]\n",
-      "tokenizer_config.json: 100% 396/396 [00:00<00:00, 2.06MB/s]\n",
-      "tokenizer.json: 100% 2.11M/2.11M [00:00<00:00, 11.6MB/s]\n",
-      "special_tokens_map.json: 100% 99.0/99.0 [00:00<00:00, 555kB/s]\n",
-      "2023-11-29:11:56:18,658 WARNING  [task.py:614] [Task: demo_boolq] metric acc is defined, but aggregation is not. using default aggregation=mean\n",
-      "2023-11-29:11:56:18,658 WARNING  [task.py:626] [Task: demo_boolq] metric acc is defined, but higher_is_better is not. using default higher_is_better=True\n",
-      "Downloading builder script: 100% 30.7k/30.7k [00:00<00:00, 59.0MB/s]\n",
-      "Downloading metadata: 100% 38.7k/38.7k [00:00<00:00, 651kB/s]\n",
-      "Downloading readme: 100% 14.8k/14.8k [00:00<00:00, 37.3MB/s]\n",
-      "Downloading data: 100% 4.12M/4.12M [00:00<00:00, 55.1MB/s]\n",
-      "Generating train split: 100% 9427/9427 [00:00<00:00, 15630.89 examples/s]\n",
-      "Generating validation split: 100% 3270/3270 [00:00<00:00, 20002.56 examples/s]\n",
-      "Generating test split: 100% 3245/3245 [00:00<00:00, 20866.19 examples/s]\n",
-      "2023-11-29:11:56:22,315 INFO     [task.py:355] Building contexts for task on rank 0...\n",
-      "2023-11-29:11:56:22,322 INFO     [evaluator.py:319] Running loglikelihood requests\n",
-      "100% 20/20 [00:04<00:00,  4.37it/s]\n",
-      "fatal: not a git repository (or any of the parent directories): .git\n",
-      "hf (pretrained=EleutherAI/pythia-2.8b), gen_kwargs: (), limit: 10.0, num_fewshot: None, batch_size: 1\n",
-      "|  Tasks   |Version|Filter|n-shot|Metric|Value|   |Stderr|\n",
-      "|----------|-------|------|-----:|------|----:|---|-----:|\n",
-      "|demo_boolq|Yaml   |none  |     0|acc   |    1|±  |     0|\n",
-      "\n"
-     ]
-    }
-   ],
-   "source": [
-    "%env LOGLEVEL=DEBUG\n",
-    "!lm_eval \\\n",
-    "    --model hf \\\n",
-    "    --model_args pretrained=EleutherAI/pythia-2.8b \\\n",
-    "    --include_path ./ \\\n",
-    "    --tasks demo_boolq \\\n",
-    "    --limit 10"
-   ]
+   "outputs": [],
+   "source": "%env LMEVAL_LOG_LEVEL=DEBUG\n!lm_eval \\\n    --model hf \\\n    --model_args pretrained=EleutherAI/pythia-2.8b \\\n    --include_path ./ \\\n    --tasks demo_boolq \\\n    --limit 10"
   },
   {
    "cell_type": "markdown",
@@ -415,64 +366,12 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 6,
+   "execution_count": null,
    "metadata": {
     "id": "XceRKCuuDtbn"
    },
-   "outputs": [
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "2023-11-29:11:56:33,016 INFO     [utils.py:160] NumExpr defaulting to 2 threads.\n",
-      "2023-11-29 11:56:33.852995: E tensorflow/compiler/xla/stream_executor/cuda/cuda_dnn.cc:9342] Unable to register cuDNN factory: Attempting to register factory for plugin cuDNN when one has already been registered\n",
-      "2023-11-29 11:56:33.853050: E tensorflow/compiler/xla/stream_executor/cuda/cuda_fft.cc:609] Unable to register cuFFT factory: Attempting to register factory for plugin cuFFT when one has already been registered\n",
-      "2023-11-29 11:56:33.853087: E tensorflow/compiler/xla/stream_executor/cuda/cuda_blas.cc:1518] Unable to register cuBLAS factory: Attempting to register factory for plugin cuBLAS when one has already been registered\n",
-      "2023-11-29 11:56:35.129047: W tensorflow/compiler/tf2tensorrt/utils/py_utils.cc:38] TF-TRT Warning: Could not find TensorRT\n",
-      "2023-11-29:11:56:38,546 INFO     [__main__.py:132] Verbosity set to INFO\n",
-      "2023-11-29:11:56:47,509 WARNING  [__main__.py:138]  --limit SHOULD ONLY BE USED FOR TESTING.REAL METRICS SHOULD NOT BE COMPUTED USING LIMIT.\n",
-      "2023-11-29:11:56:47,509 INFO     [__main__.py:143] Including path: ./\n",
-      "2023-11-29:11:56:47,517 INFO     [__main__.py:205] Selected Tasks: ['yes_or_no_tasks']\n",
-      "2023-11-29:11:56:47,520 WARNING  [evaluator.py:93] generation_kwargs specified through cli, these settings will be used over set parameters in yaml tasks.\n",
-      "2023-11-29:11:56:47,550 INFO     [huggingface.py:120] Using device 'cuda'\n",
-      "2023-11-29:11:57:08,743 WARNING  [task.py:614] [Task: demo_cola] metric acc is defined, but aggregation is not. using default aggregation=mean\n",
-      "2023-11-29:11:57:08,743 WARNING  [task.py:626] [Task: demo_cola] metric acc is defined, but higher_is_better is not. using default higher_is_better=True\n",
-      "Downloading builder script: 100% 28.8k/28.8k [00:00<00:00, 52.7MB/s]\n",
-      "Downloading metadata: 100% 28.7k/28.7k [00:00<00:00, 51.9MB/s]\n",
-      "Downloading readme: 100% 27.9k/27.9k [00:00<00:00, 48.0MB/s]\n",
-      "Downloading data: 100% 377k/377k [00:00<00:00, 12.0MB/s]\n",
-      "Generating train split: 100% 8551/8551 [00:00<00:00, 19744.58 examples/s]\n",
-      "Generating validation split: 100% 1043/1043 [00:00<00:00, 27057.01 examples/s]\n",
-      "Generating test split: 100% 1063/1063 [00:00<00:00, 22705.17 examples/s]\n",
-      "2023-11-29:11:57:11,698 INFO     [task.py:355] Building contexts for task on rank 0...\n",
-      "2023-11-29:11:57:11,704 INFO     [evaluator.py:319] Running loglikelihood requests\n",
-      "100% 20/20 [00:03<00:00,  5.15it/s]\n",
-      "fatal: not a git repository (or any of the parent directories): .git\n",
-      "hf (pretrained=EleutherAI/pythia-2.8b), gen_kwargs: (), limit: 10.0, num_fewshot: None, batch_size: 1\n",
-      "|     Tasks     |Version|Filter|n-shot|Metric|Value|   |Stderr|\n",
-      "|---------------|-------|------|-----:|------|----:|---|-----:|\n",
-      "|yes_or_no_tasks|N/A    |none  |     0|acc   |  0.7|±  |0.1528|\n",
-      "| - demo_cola   |Yaml   |none  |     0|acc   |  0.7|±  |0.1528|\n",
-      "\n",
-      "|    Groups     |Version|Filter|n-shot|Metric|Value|   |Stderr|\n",
-      "|---------------|-------|------|-----:|------|----:|---|-----:|\n",
-      "|yes_or_no_tasks|N/A    |none  |     0|acc   |  0.7|±  |0.1528|\n",
-      "\n"
-     ]
-    }
-   ],
-   "source": [
-    "# !accelerate launch --no_python\n",
-    "%env LOGLEVEL=DEBUG\n",
-    "!lm_eval \\\n",
-    "    --model hf \\\n",
-    "    --model_args pretrained=EleutherAI/pythia-2.8b \\\n",
-    "    --include_path ./ \\\n",
-    "    --tasks yes_or_no_tasks \\\n",
-    "    --limit 10 \\\n",
-    "    --output output/yes_or_no_tasks/ \\\n",
-    "    --log_samples"
-   ]
+   "outputs": [],
+   "source": "# !accelerate launch --no_python\n%env LMEVAL_LOG_LEVEL=DEBUG\n!lm_eval \\\n    --model hf \\\n    --model_args pretrained=EleutherAI/pythia-2.8b \\\n    --include_path ./ \\\n    --tasks yes_or_no_tasks \\\n    --limit 10 \\\n    --output output/yes_or_no_tasks/ \\\n    --log_samples"
   },
   {
    "cell_type": "markdown",
@@ -520,59 +419,12 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 8,
+   "execution_count": null,
    "metadata": {
     "id": "jyKOfCsKb-xy"
    },
-   "outputs": [
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "2023-11-29:11:57:23,598 INFO     [utils.py:160] NumExpr defaulting to 2 threads.\n",
-      "2023-11-29 11:57:24.719750: E tensorflow/compiler/xla/stream_executor/cuda/cuda_dnn.cc:9342] Unable to register cuDNN factory: Attempting to register factory for plugin cuDNN when one has already been registered\n",
-      "2023-11-29 11:57:24.719806: E tensorflow/compiler/xla/stream_executor/cuda/cuda_fft.cc:609] Unable to register cuFFT factory: Attempting to register factory for plugin cuFFT when one has already been registered\n",
-      "2023-11-29 11:57:24.719847: E tensorflow/compiler/xla/stream_executor/cuda/cuda_blas.cc:1518] Unable to register cuBLAS factory: Attempting to register factory for plugin cuBLAS when one has already been registered\n",
-      "2023-11-29 11:57:26.656125: W tensorflow/compiler/tf2tensorrt/utils/py_utils.cc:38] TF-TRT Warning: Could not find TensorRT\n",
-      "2023-11-29:11:57:31,563 INFO     [__main__.py:132] Verbosity set to INFO\n",
-      "2023-11-29:11:57:40,541 WARNING  [__main__.py:138]  --limit SHOULD ONLY BE USED FOR TESTING.REAL METRICS SHOULD NOT BE COMPUTED USING LIMIT.\n",
-      "2023-11-29:11:57:40,541 INFO     [__main__.py:143] Including path: ./\n",
-      "2023-11-29:11:57:40,558 INFO     [__main__.py:205] Selected Tasks: ['demo_mmlu_high_school_geography']\n",
-      "2023-11-29:11:57:40,559 WARNING  [evaluator.py:93] generation_kwargs specified through cli, these settings will be used over set parameters in yaml tasks.\n",
-      "2023-11-29:11:57:40,589 INFO     [huggingface.py:120] Using device 'cuda'\n",
-      "Downloading builder script: 100% 5.84k/5.84k [00:00<00:00, 17.7MB/s]\n",
-      "Downloading metadata: 100% 106k/106k [00:00<00:00, 892kB/s] \n",
-      "Downloading readme: 100% 39.7k/39.7k [00:00<00:00, 631kB/s]\n",
-      "Downloading data: 100% 166M/166M [00:01<00:00, 89.0MB/s]\n",
-      "Generating auxiliary_train split: 100% 99842/99842 [00:07<00:00, 12536.83 examples/s]\n",
-      "Generating test split: 100% 198/198 [00:00<00:00, 1439.20 examples/s]\n",
-      "Generating validation split: 100% 22/22 [00:00<00:00, 4181.76 examples/s]\n",
-      "Generating dev split: 100% 5/5 [00:00<00:00, 36.25 examples/s]\n",
-      "2023-11-29:11:58:09,798 INFO     [task.py:355] Building contexts for task on rank 0...\n",
-      "2023-11-29:11:58:09,822 INFO     [evaluator.py:319] Running loglikelihood requests\n",
-      "100% 40/40 [00:05<00:00,  7.86it/s]\n",
-      "fatal: not a git repository (or any of the parent directories): .git\n",
-      "hf (pretrained=EleutherAI/pythia-2.8b), gen_kwargs: (), limit: 10.0, num_fewshot: None, batch_size: 1\n",
-      "|             Tasks             |Version|Filter|n-shot| Metric |Value|   |Stderr|\n",
-      "|-------------------------------|-------|------|-----:|--------|----:|---|-----:|\n",
-      "|demo_mmlu_high_school_geography|Yaml   |none  |     0|acc     |  0.3|±  |0.1528|\n",
-      "|                               |       |none  |     0|acc_norm|  0.3|±  |0.1528|\n",
-      "\n"
-     ]
-    }
-   ],
-   "source": [
-    "# !accelerate launch --no_python\n",
-    "%env LOGLEVEL=DEBUG\n",
-    "!lm_eval \\\n",
-    "    --model hf \\\n",
-    "    --model_args pretrained=EleutherAI/pythia-2.8b \\\n",
-    "    --include_path ./ \\\n",
-    "    --tasks demo_mmlu_high_school_geography \\\n",
-    "    --limit 10 \\\n",
-    "    --output output/mmlu_high_school_geography/ \\\n",
-    "    --log_samples"
-   ]
+   "outputs": [],
+   "source": "# !accelerate launch --no_python\n%env LMEVAL_LOG_LEVEL=DEBUG\n!lm_eval \\\n    --model hf \\\n    --model_args pretrained=EleutherAI/pythia-2.8b \\\n    --include_path ./ \\\n    --tasks demo_mmlu_high_school_geography \\\n    --limit 10 \\\n    --output output/mmlu_high_school_geography/ \\\n    --log_samples"
   },
   {
    "cell_type": "markdown",
@@ -605,51 +457,12 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 10,
+   "execution_count": null,
    "metadata": {
     "id": "-_CVnDirdy7j"
    },
-   "outputs": [
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "2023-11-29:11:58:21,284 INFO     [utils.py:160] NumExpr defaulting to 2 threads.\n",
-      "2023-11-29 11:58:22.850159: E tensorflow/compiler/xla/stream_executor/cuda/cuda_dnn.cc:9342] Unable to register cuDNN factory: Attempting to register factory for plugin cuDNN when one has already been registered\n",
-      "2023-11-29 11:58:22.850219: E tensorflow/compiler/xla/stream_executor/cuda/cuda_fft.cc:609] Unable to register cuFFT factory: Attempting to register factory for plugin cuFFT when one has already been registered\n",
-      "2023-11-29 11:58:22.850254: E tensorflow/compiler/xla/stream_executor/cuda/cuda_blas.cc:1518] Unable to register cuBLAS factory: Attempting to register factory for plugin cuBLAS when one has already been registered\n",
-      "2023-11-29 11:58:24.948103: W tensorflow/compiler/tf2tensorrt/utils/py_utils.cc:38] TF-TRT Warning: Could not find TensorRT\n",
-      "2023-11-29:11:58:28,460 INFO     [__main__.py:132] Verbosity set to INFO\n",
-      "2023-11-29:11:58:37,935 WARNING  [__main__.py:138]  --limit SHOULD ONLY BE USED FOR TESTING.REAL METRICS SHOULD NOT BE COMPUTED USING LIMIT.\n",
-      "2023-11-29:11:58:37,935 INFO     [__main__.py:143] Including path: ./\n",
-      "2023-11-29:11:58:37,969 INFO     [__main__.py:205] Selected Tasks: ['demo_mmlu_high_school_geography_continuation']\n",
-      "2023-11-29:11:58:37,972 WARNING  [evaluator.py:93] generation_kwargs specified through cli, these settings will be used over set parameters in yaml tasks.\n",
-      "2023-11-29:11:58:38,008 INFO     [huggingface.py:120] Using device 'cuda'\n",
-      "2023-11-29:11:58:59,758 INFO     [task.py:355] Building contexts for task on rank 0...\n",
-      "2023-11-29:11:58:59,777 INFO     [evaluator.py:319] Running loglikelihood requests\n",
-      "100% 40/40 [00:02<00:00, 16.23it/s]\n",
-      "fatal: not a git repository (or any of the parent directories): .git\n",
-      "hf (pretrained=EleutherAI/pythia-2.8b), gen_kwargs: (), limit: 10.0, num_fewshot: None, batch_size: 1\n",
-      "|                   Tasks                    |Version|Filter|n-shot| Metric |Value|   |Stderr|\n",
-      "|--------------------------------------------|-------|------|-----:|--------|----:|---|-----:|\n",
-      "|demo_mmlu_high_school_geography_continuation|Yaml   |none  |     0|acc     |  0.1|±  |0.1000|\n",
-      "|                                            |       |none  |     0|acc_norm|  0.2|±  |0.1333|\n",
-      "\n"
-     ]
-    }
-   ],
-   "source": [
-    "# !accelerate launch --no_python\n",
-    "%env LOGLEVEL=DEBUG\n",
-    "!lm_eval \\\n",
-    "    --model hf \\\n",
-    "    --model_args pretrained=EleutherAI/pythia-2.8b \\\n",
-    "    --include_path ./ \\\n",
-    "    --tasks demo_mmlu_high_school_geography_continuation \\\n",
-    "    --limit 10 \\\n",
-    "    --output output/mmlu_high_school_geography_continuation/ \\\n",
-    "    --log_samples"
-   ]
+   "outputs": [],
+   "source": "# !accelerate launch --no_python\n%env LMEVAL_LOG_LEVEL=DEBUG\n!lm_eval \\\n    --model hf \\\n    --model_args pretrained=EleutherAI/pythia-2.8b \\\n    --include_path ./ \\\n    --tasks demo_mmlu_high_school_geography_continuation \\\n    --limit 10 \\\n    --output output/mmlu_high_school_geography_continuation/ \\\n    --log_samples"
   },
   {
    "cell_type": "markdown",
diff --git a/lm_eval/__main__.py b/lm_eval/__main__.py
@@ -231,7 +231,7 @@ def setup_parser() -> argparse.ArgumentParser:
         type=str.upper,
         default=None,
         metavar="CRITICAL|ERROR|WARNING|INFO|DEBUG",
-        help="(Deprecated) Controls logging verbosity level. Use the `LOGLEVEL` environment variable instead. Set to DEBUG for detailed output when testing or adding new task configurations.",
+        help="(Deprecated) Controls logging verbosity level. Use the `LMEVAL_LOG_LEVEL` environment variable instead. Set to DEBUG for detailed output when testing or adding new task configurations.",
     )
     parser.add_argument(
         "--wandb_args",
diff --git a/lm_eval/utils.py b/lm_eval/utils.py
@@ -58,7 +58,7 @@ def format(self, record):
         datefmt="%Y-%m-%d:%H:%M:%S",
     )
 
-    log_level = os.environ.get("LOGLEVEL", verbosity) or verbosity
+    log_level = os.environ.get("LMEVAL_LOG_LEVEL", verbosity) or verbosity
 
     level_map = {
         "DEBUG": logging.DEBUG,

Original file line number	Diff line number	Diff line change
`@@ -41,7 +41,7 @@ and rename the folders and YAML file(s) as desired.`
`41`	`41`	All data downloading and management is handled through the HuggingFace (HF) [`datasets`](https://github.com/huggingface/datasets) API. So, the first thing you should do is check to see if your task's dataset is already provided in their catalog [here](https://huggingface.co/datasets). If it's not in there, please consider adding it to their Hub to make it accessible to a wider user base by following their [new dataset guide](https://github.com/huggingface/datasets/blob/main/ADD_NEW_DATASET.md)
`42`	`42`	`.`
`43`	`43`	`> [!TIP]`
`44`		-> To test your task, we recommend using verbose logging using `export LOGLEVEL = DEBUG` in your shell before running the evaluation script. This will help you debug any issues that may arise.
	`44`	+> To test your task, we recommend using verbose logging using `export LMEVAL_LOG_LEVEL="DEBUG"` in your shell before running the evaluation script. This will help you debug any issues that may arise.
`45`	`45`	`Once you have a HuggingFace dataset prepared for your task, we want to assign our new YAML to use this dataset:`
`46`	`46`
`47`	`47`	```yaml
Original file line number	Diff line number	Diff line change
`@@ -231,7 +231,7 @@ def setup_parser() -> argparse.ArgumentParser:`
`231`	`231`	`type=str.upper,`
`232`	`232`	`default=None,`
`233`	`233`	`metavar="CRITICAL\|ERROR\|WARNING\|INFO\|DEBUG",`
`234`		- help="(Deprecated) Controls logging verbosity level. Use the `LOGLEVEL` environment variable instead. Set to DEBUG for detailed output when testing or adding new task configurations.",
	`234`	+ help="(Deprecated) Controls logging verbosity level. Use the `LMEVAL_LOG_LEVEL` environment variable instead. Set to DEBUG for detailed output when testing or adding new task configurations.",
`235`	`235`	`)`
`236`	`236`	`parser.add_argument(`
`237`	`237`	`"--wandb_args",`
Original file line number	Diff line number	Diff line change
`@@ -58,7 +58,7 @@ def format(self, record):`
`58`	`58`	`datefmt="%Y-%m-%d:%H:%M:%S",`
`59`	`59`	`)`
`60`	`60`
`61`		`- log_level = os.environ.get("LOGLEVEL", verbosity) or verbosity`
	`61`	`+ log_level = os.environ.get("LMEVAL_LOG_LEVEL", verbosity) or verbosity`
`62`	`62`
`63`	`63`	`level_map = {`
`64`	`64`	`"DEBUG": logging.DEBUG,`