Skip to content

Commit 4026be3

Browse files
authored
Merge pull request #41 from hschellman/Nov25-update
Nov25 update - table of contents and new tokens information .
2 parents 21485e4 + 26d6e1e commit 4026be3

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

66 files changed

+1347
-185
lines changed

_episodes/01-introduction.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,8 @@ keypoints:
1212
- This tutorial is brought to you by the DUNE Computing Consortium.
1313
- The goals are to give you the computing basis to work on DUNE.
1414
---
15+
16+
{% include 01-introduction.toc.md %}
1517
## DUNE Computing Consortium
1618

1719
The DUNE Computing Consortium works to establish a global computing network that will handle the massive data streams produced by distributing these across the computing grid.

_episodes/01.5-documentation.md

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -11,6 +11,8 @@ keypoints:
1111
- There is documentation somewhere!
1212
---
1313

14+
{% include 01.5-documentation.toc.md %}
15+
1416
## Documentation access
1517

1618
Much of DUNE's computing documentation is public and hosted in github
@@ -77,4 +79,5 @@ Many repositories have wikis or associated dune.github.io pages.
7779

7880
[Computing FAQ](https://github.com/orgs/DUNE/projects/19)
7981

80-
Lists of common connection problems and issues with running jobs.
82+
Lists of common connection problems and issues with running jobs.
83+

_episodes/02-storage-spaces.md

Lines changed: 104 additions & 29 deletions
Large diffs are not rendered by default.

_episodes/02.3-cvmfs.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -10,6 +10,7 @@ keypoints:
1010
- CVMFS distributes software and related files without installing them on the target computer (using a VM, Virtual Machine).
1111
---
1212

13+
{% include 02.3-cvmfs.toc.md %}
1314

1415
## CVMFS
1516
**What is CVMFS and why do we need it?**

_episodes/03-data-management.md

Lines changed: 38 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,9 @@ keypoints:
1313
- Xrootd allows user to stream data files.
1414
---
1515

16-
#### Session Video
16+
{% include 03-data-management.toc.md %}
17+
18+
## Session Video
1719

1820
<!--The session will be captured on video a placed here after the workshop for asynchronous study.-->
1921

@@ -77,7 +79,11 @@ If you want to process data using the full power of DUNE computing, you should t
7779

7880
## How to find and access official data
7981

80-
### What is metacat?
82+
{% include OfficialDatasets_include.md %}
83+
84+
You can also query the catalogs yourself using [metacat][metacat] and [rucio][rucio] catalogs. Metacat contains information about file content and official datasets, rucio stores the physical location of those files. Files should have entries in both catalogs. Generally you ask metacat first to find the files you want and then ask rucio for their location.
85+
86+
## What is metacat?
8187

8288
Metacat is a file and dataset catalog - it allows you to search for files and datasets that have particular attributes and understand their provenance, including details on all of their processing steps.
8389
It also allows for querying jointly the file catalog and the DUNE conditions database.
@@ -94,9 +100,9 @@ DUNE runs multiple experiments (far detectors, protodune-sp, protodune-dp hd-pro
94100

95101
To find your data you need to specify at the minimum
96102

97-
- `core.run_type` (the experiment)
103+
- `core.run_type` (the experiment: fardet-vd, hd-protodune ...)
98104
- `core.file_type` (mc or detector)
99-
- `core.data_tier` (the level of processing raw, full-reconstructed, root-tuple)
105+
- `core.data_tier` (the level of processing raw, full-reconstructed, root-tuple ...)
100106

101107
and when searching for specific types of data
102108

@@ -145,7 +151,8 @@ First get metacat if you have not already done so
145151
token authentication.
146152
{: .callout} -->
147153

148-
### then do queries to find particular sets of files
154+
### then do queries to find particular groups of files
155+
149156

150157
~~~
151158
metacat query "files from dune:all where core.file_type=detector and core.run_type=hd-protodune and core.data_tier=raw and core.runs[any]=27331 limit 1"
@@ -240,10 +247,9 @@ Total size: 17553648200600 (17.554 TB)
240247
{: .output}
241248

242249

243-
244250
<!-- To look at all the files in that run you need to use XRootD - **DO NOT TRY TO COPY 4 TB to your local area!!!*** -->
245251

246-
## Official datasets <a name="Official_Datasets"></a>
252+
<!-- ## Official datasets <a name="Official_Datasets"></a>
247253
248254
The production group make official datasets which are sets of files which share important characteristics such as experiment, data_tier, data_stream, processing version and processing configuration.
249255
@@ -335,11 +341,29 @@ fardet-vd:fardet-vd__full-reconstructed__v09_81_00d02__reco2_dunevd10kt_anu_1x8x
335341
You can also do keyword/value queries like the ones above using the Other tab on the web-based Data Catalog.
336342
337343
![Full query search](../fig/otherquery.png){: .image-with-shadow }
344+
-->
345+
346+
### find out how much data there is in a dataset
347+
348+
Do a query using the `-s` or `--summary` option
349+
350+
~~~
351+
metacat query -s "files from fardet-vd:fardet-vd__full-reconstructed__v09_81_00d02__reco2_dunevd10kt_anu_1x8x6_3view_30deg_geov3__prodgenie_anu_numu2nue_nue2nutau_dunevd10kt_1x8x6_3view_30deg__out1__v2_official"
352+
~~~
353+
{: .language-bash}
354+
355+
~~~
356+
Files: 20648
357+
Total size: 34550167782531 (34.550 TB)
358+
~~~
359+
{: .output}
338360

361+
this may take a while as that is a big dataset.
339362

363+
340364
### What describes a dataset?
341365

342-
Let's look at the metadata describing that anti-neutrino dataset: the -j means json output
366+
Let's look at the metadata describing an anti-neutrino dataset: the -j means json output
343367

344368
~~~
345369
metacat dataset show -j fardet-vd:fardet-vd__full-reconstructed__v09_81_00d02__reco2_dunevd10kt_anu_1x8x6_3view_30deg_geov3__prodgenie_anu_numu2nue_nue2nutau_dunevd10kt_1x8x6_3view_30deg__out1__v2_official
@@ -386,7 +410,7 @@ You can use any of those keys to refine dataset searches as we did above. You pr
386410

387411
### What files are in that dataset and how do I use them?
388412

389-
You can either click on a dataset in the web data catalog or:
413+
You can either locate and click on a dataset in the [web data catalog](https://dune-tech.rice.edu/dunecatalog/) or use the[metacat web interface](https://metacat.fnal.gov:9443/dune_meta_prod/app/gui) or use the command line:
390414

391415
~~~
392416
metacat query "files from fardet-vd:fardet-vd__full-reconstructed__v09_81_00d02__reco2_dunevd10kt_anu_1x8x6_3view_30deg_geov3__prodgenie_anu_numu2nue_nue2nutau_dunevd10kt_1x8x6_3view_30deg__out1__v2_official limit 10"
@@ -398,7 +422,7 @@ will list the first 10 files in that dataset (you probably don't want to list al
398422
You can also use a similar query in your batch job to get the files you want.
399423

400424

401-
### Finding those files on disk
425+
## Finding those files on disk
402426

403427
To find your files, you need to use [Rucio](#Rucio) directly or give the [justIN](https://dunejustin.fnal.gov/docs/tutorials.dune.md) batch system your query and it will locate them for you.
404428

@@ -417,7 +441,8 @@ export SAM_EXPERIMENT=dune
417441
-->
418442
## Getting file locations using Rucio
419443

420-
### What is Rucio? <a name="Rucio"></a>
444+
### What is Rucio?
445+
<!-- <a name="Rucio"></a> -->
421446
Rucio is the next-generation Data Replica service and is part of DUNE's new Distributed Data Management (DDM) system that is currently in deployment.
422447
Rucio has two functions:
423448
1. A rule-based system to get files to Rucio Storage Elements around the world and keep them there.
@@ -427,7 +452,7 @@ As of the date of the 2025 tutorial:
427452
- The Rucio client is available in CVMFS and Spack
428453
- Most DUNE users are now enabled to use it. New users may not automatically be added.
429454

430-
### You will need to authenticate to use read files
455+
### You will need to authenticate to read files
431456

432457
> #### For SL7 use justin to get a token
433458
{:.callout}
@@ -498,7 +523,7 @@ which the locations of the file on disk and tape. We can use this to copy the f
498523
> Try to access the file at manchester using the command:
499524
> ~~~
500525
> root -l root://meitner.tier2.hep.manchester.ac.uk:1094//cephfs/experiments/dune/RSE/fardet-vd/fd/a6/prodmarley_nue_es_flat_radiological_decay0_dunevd10kt_1x8x14_3view_30deg_20250217T033222Z_gen_004122_supernova_g4stage1_g4stage2_detsim_reco.root
501-
> _file0->ls
526+
> _file0->ls()
502527
> ~~~
503528
> {: .language-bash}
504529
{: .challenge}

_episodes/03.2-UPS.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,7 @@ keypoints:
1717
> You need to be in the Apptainer to use it.
1818
> UPS is being replaced by a new [spack][Spack Documentation] system for Alma9. We will be adding a Spack tutorial soon but for now, you need to use SL7/UPS to use the full DUNE code stack.
1919
>
20-
> Go back and look at the [SL7/Apptainer]({{ site.baseurl }}/setup.html#SL7_setup) instructions to get an SL7 container for this section.
20+
> Go back and look at the [SL7/Apptainer]({{ site.baseurl }}/sl7_setup) instructions to get an SL7 container for this section.
2121
{: .challenge}
2222

2323
An important requirement for making valid physics results is computational reproducibility. You need to be able to repeat the same calculations on the data and MC and get the same answers every time. You may be asked to produce a slightly different version of a plot for example, and the data that goes into it has to be the same every time you run the program.

_episodes/04-Spack.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -10,6 +10,8 @@ keypoints:
1010
- Spack is a tool to deliver well defined software configurations
1111
- CVMFS distributes software and related files without installing them on the target computer (using a VM, Virtual Machine).
1212
---
13+
14+
{% include 04-Spack.toc.md %}
1315
## What is Spack and why do we need it?
1416

1517
> ## Note

_episodes/05.1-improve-code-efficiency.md

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -10,11 +10,13 @@ keypoints:
1010
- CPU, memory, and build time optimizations are possible when good code practices are followed.
1111
---
1212

13-
#### Session Video
13+
## Improve your Code efficiency
14+
15+
### Session Video
1416

1517
The session will be captured on video a placed here after the workshop for asynchronous study.
1618

17-
#### Live Notes
19+
### Live Notes
1820

1921
<!-- Participants are encouraged to monitor and utilize the [Livedoc for May. 2023](https://docs.google.com/document/d/19XMQqQ0YV2AtR5OdJJkXoDkuRLWv30BnHY9C5N92uYs/edit?usp=sharing) to ask questions and learn. For reference, the [Livedoc from Jan. 2023](https://docs.google.com/document/d/1sgRQPQn1OCMEUHAk28bTPhZoySdT5NUSDnW07aL-iQU/edit?usp=sharing) is provided.
2022
-->

_extras/Common-Error-Messages.md

Lines changed: 22 additions & 19 deletions
Original file line numberDiff line numberDiff line change
@@ -5,37 +5,40 @@ keypoints:
55
- Errors that people report in doing the tutorial
66
---
77

8-
- #### `/usr/bin/xauth: unable to write authority file`
9-
#### `disk quota exceeded error with metacat auth login`
8+
## Common Error Messages
109

11-
These likely means your kerberos ticket was not forwarded and you can't access your home are without it. do a kinit in your terminal session. Or possibly you really have filled your home area.
1210

13-
- #### `bash: setup: command not found`
11+
{% include Common-Error-Messages.toc.md %}
1412

15-
setup is a UPS command. You need to be running in the Apptainer and setup the DUNE ups system - check out the instructions in [SL7 setup]
16-
({{ site.baseurl }}/sl7_setup)
13+
### Error: /usr/bin/xauth: unable to write authority file
1714

15+
These likely means your kerberos ticket was not forwarded and you can't access your home are without it. do a kinit in your terminal session. Or possibly you really have filled your home area.
1816

19-
- #### `SyntaxError: future feature annotations is not defined`
17+
### bash: setup: command not found
2018

21-
This looks like a bad python version, try doing `which python` if it isn't > 3.9 you don't have a modern python version.
19+
setup is a UPS command. You need to be running in the Apptainer and setup the DUNE ups system - check out the instructions in [SL7 setup]({{ site.baseurl }}/sl7_setup)
2220

23-
- On SL7 we suggest setting up the dunesw as shown in the example setup. alternatively you can
2421

25-
~~~
26-
setup root -v v6_28_12 -q e26:p3915:prof
27-
~~~
28-
{: .language-bash}
22+
### SyntaxError: future feature annotations is not defined
2923

30-
- On AL9 we suggest loading ROOT which brings in a modern version of python and allows xrootd access to data.
24+
This looks like a bad python version, try doing `which python` if it isn't > 3.9 you don't have a modern python version.
3125

32-
~~~
33-
spack load root@6.28.12
34-
~~~
35-
{: .language-bash}
26+
- On SL7 we suggest setting up the dunesw as shown in the example setup. alternatively you can
3627

28+
~~~
29+
setup root -v v6_28_12 -q e26:p3915:prof
30+
~~~
31+
{: .language-bash}
3732

38-
- #### Spack ==> `Error: somecode matches multiple packages
33+
- On AL9 we suggest loading ROOT which brings in a modern version of python and allows xrootd access to data.
34+
35+
~~~
36+
spack load root@6.28.12
37+
~~~
38+
{: .language-bash}
39+
40+
41+
### Spack : Error: somecode matches multiple packages
3942
~~~
4043
Matching packages:
4144
jhpj2js somecode@6.28.06%gcc@12.2.0 arch=linux-almalinux9-x86_64_v2

_extras/ComputerSetup.md

Lines changed: 10 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -12,13 +12,17 @@ keypoints:
1212
- It is also something almost all people who get paid to program are expected to know well
1313
---
1414

15-
## 0. Back up your machine
15+
## Computer setup
16+
17+
{% include ComputerSetup.toc.md %}
18+
19+
### Back up your machine
1620

1721
We are going to be messing with your operating system at some level so it is extremely wise to do a complete backup of your machine to an external drive right now.
1822

1923
Also turn off automatic updates. Operating system updates can mess with your setup. Generally, back up before doing updates so you can revert if necessary.
2024

21-
## 1. Open a unix terminal window
25+
### Open a unix terminal window
2226

2327
First figure out how to open a terminal on your system. The Carpentries Shell Training has a [section that explains this][New Shell]
2428

@@ -35,7 +39,7 @@ On Windows it's a bit more complicated as the underlying operating system is not
3539

3640

3741

38-
## 2. Learn how to use the Unix Shell
42+
### Learn how to use the Unix Shell
3943

4044
<!-- First figure out [how to open a terminal on your system][New Shell]
4145
-->
@@ -47,7 +51,7 @@ It tells you how to start a terminal session in Windows, Mac OSX and Unix system
4751
Please do that [unix shell tutorial][Unix Shell Basics] to learn about the basic command line.
4852

4953

50-
## 3. Install an x-windows emulator
54+
### Install an x-windows emulator
5155

5256
#### MacOS
5357

@@ -88,7 +92,7 @@ See the information about [Windows]({{ site.baseurl }}/Windows.html) terminal co
8892
> You should now be ready to go for the ({{ site.baseurl }}/setup)
8993
{: .callout}
9094

91-
## Extra - Get a compiler/code editor
95+
### Extra - Get a code editor
9296

9397
Although you will mainly be using python to code to begin with, most HEP code is actually C++ and it is good to have access to a C++ compiler. Bonus is that you normally get a good editor as well.
9498

@@ -108,7 +112,7 @@ You can also use vim or emacs if you are old school.
108112
Likely you should load up the full [Visual Studio][Visual Studio] as it has a nice C++ compiler
109113

110114

111-
### Useful Links
115+
## Useful Links
112116

113117
[HSF Training Center][HSF Training Center]
114118

0 commit comments

Comments
 (0)