You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+5-5Lines changed: 5 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -59,8 +59,8 @@ The benchmark framework relies on the following inputs produced by the [SNB Data
59
59
60
60
### Driver modes
61
61
62
-
For each implementation, it is possible to perform to perform the run in one of the [SNB driver's](https://github.com/ldbc/ldbc_snb_interactive_driver) three modes.
63
-
All three should be started withe the initial data set loaded to the database.
62
+
For each implementation, it is possible to perform to perform the run in one of the [SNB driver's](https://github.com/ldbc/ldbc_snb_interactive_driver) three modes: create validation parameters, validate, and benchmark.
63
+
The execution in all three modes should be started after the initial data set was loaded into the system under test.
64
64
65
65
1. Create validation parameters with the `driver/create-validation-parameters.sh` script.
66
66
@@ -71,7 +71,7 @@ All three should be started withe the initial data set loaded to the database.
71
71
***Output:** The results will be stored in the validation parameters file (e.g. `validation_params.csv`) file set in the `create_validation_parameters` configuration property.
72
72
***Parallelism:** The execution must be single-threaded to ensure a deterministic order of operations.
73
73
74
-
2. Validate against existing validation parameters with the `driver/validate.sh` script.
74
+
2. Validate against an existing reference output (called "validation parameters") with the `driver/validate.sh` script.
75
75
76
76
***Input:**
77
77
* The query substitution parameters are taken from the validation parameters file (e.g. `validation_params.csv`) file set in the `validate_database` configuration property.
@@ -82,7 +82,7 @@ All three should be started withe the initial data set loaded to the database.
82
82
* If the validation failed, the results are saved to the `validation_params-failed-expected.json` and `validation_params-failed-actual.json` files.
83
83
***Parallelism:** The execution must be single-threaded to ensure a deterministic order of operations.
84
84
85
-
Pre-generated [validation data sets for SF0.1 to SF10](https://pub-383410a98aef4cb686f0c7601eddd25f.r2.dev/interactive-v1/validation_params-sf0.1-sf10.tar.zst) are available.
85
+
Pre-generated [validation parameters for SF0.1 to SF10](https://pub-383410a98aef4cb686f0c7601eddd25f.r2.dev/interactive-v1/validation_params-sf0.1-sf10.tar.zst) are available.
86
86
87
87
3. Run the benchmark with the `driver/benchmark.sh` script.
88
88
@@ -100,7 +100,7 @@ All three should be started withe the initial data set loaded to the database.
100
100
* The detailed results of the benchmark are printed to the console and saved in the `results/` directory.
101
101
***Parallelism:** Multi-threaded execution is recommended to achieve the best result.
102
102
103
-
For more details on validating and benchmarking, visit the [driver wiki](https://github.com/ldbc/ldbc_snb_interactive_driver/wiki).
103
+
For more details on validating and benchmarking, visit the [driver's documentation](https://github.com/ldbc/ldbc_snb_interactive_driver/tree/v1-dev/docs).
An example configuration for scale factor 1 is given in the [`params-csv-composite-longdateformatter.ini`](https://github.com/ldbc/ldbc_snb_datagen_hadoop/blob/main/params-csv-composite-longdateformatter.ini) file of the Datagen repository.
37
37
38
-
### Preprocessing and loading
38
+
##Running the benchmark
39
39
40
40
Set the following environment variables based on your data source and where you would like to store the converted CSVs:
This preprocesses the CSVs in `${NEO4J_VANILLA_CSV_DIR}` and places the resulting CSVs in `${NEO4J_CONVERTED_CSV_DIR}`, stops any running Neo4j database instances, loads the database and starts it.
56
56
57
-
## Running the benchmark
57
+
###Running the benchmark driver
58
58
59
-
To run the scripts of benchmark framework, edit the `driver/{create-validation-parameters,validate,benchmark}.properties` files, then run their script, one of:
59
+
The instructions below explain how to run the benchmark driver in one of the three modes (createvalidationparameters,validate,benchmark). For more details on the driver modes, check the ["Driver modes" section of the main README](../README.md#driver-modes).
60
60
61
-
```bash
62
-
driver/create-validation-parameters.sh
63
-
driver/validate.sh
64
-
driver/benchmark.sh
65
-
```
61
+
#### Create validation parameters
62
+
63
+
1. Edit the `driver/benchmark.properties` file. Make sure that the `ldbc.snb.interactive.scale_factor`, `ldbc.snb.interactive.updates_dir`, `ldbc.snb.interactive.parameters_dir` properties are set correctly and are in sync.
64
+
65
+
2. Run the script:
66
+
67
+
```bash
68
+
driver/create-validation-parameters.sh
69
+
```
70
+
71
+
#### Validate
72
+
73
+
1. Edit the `driver/validate.properties` file. Make sure that the `validate_database` property points to the file you would like to validate against.
74
+
75
+
2. Run the script:
76
+
77
+
```bash
78
+
driver/validate.sh
79
+
```
80
+
81
+
#### Benchmark
82
+
83
+
1. Edit the `driver/benchmark.properties` file. Make sure that the `ldbc.snb.interactive.scale_factor`, `ldbc.snb.interactive.updates_dir`, and `ldbc.snb.interactive.parameters_dir` properties are set correctly and are in sync.
84
+
85
+
2. Run the script:
86
+
87
+
```bash
88
+
driver/benchmark.sh
89
+
```
90
+
91
+
#### Reload between runs
66
92
67
93
:warning: The default workload contains updates which are persisted in the database. Therefore, **the database needs to be reloaded or restored from backup before each run**. Use the provided `scripts/backup-database.sh` and `scripts/restore-database.sh` scripts to achieve this. Alternatively, e.g. if you lack sudo rights, use Neo4j's built-in dump and load features through the `scripts/backup-neo4j.sh` and `scripts/restore-neo4j.sh` scripts.
Copy file name to clipboardExpand all lines: duckdb/README.md
+43-10Lines changed: 43 additions & 10 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -10,27 +10,60 @@ Grab DuckDB:
10
10
scripts/get.sh
11
11
```
12
12
13
-
## Generating and loading the data set
14
-
15
-
### Generating the data set
13
+
## Generating the data set
16
14
17
15
The data sets need to be generated before loading it to the database. No preprocessing is required. To generate the data sets for DuckDB, use the same settings as for PostgreSQL, i.e. the [Hadoop-based Datagen](https://github.com/ldbc/ldbc_snb_datagen_hadoop)'s `CsvMergeForeign` serializer classes.
18
16
19
-
### Loading the data set
17
+
## Running the benchmark
18
+
19
+
Set the following environment variable based on your data source:
20
20
21
21
```bash
22
22
export DUCKDB_CSV_DIR=`pwd`/../postgres/test-data
23
-
scripts/load.sh
24
23
```
25
24
26
-
### Running the benchmark
25
+
### Loading the data set
27
26
28
-
To run the scripts of benchmark framework, edit the `driver/{create-validation-parameters,validate,benchmark}.properties` files, then run their script, one of:
27
+
Load the data set as follows:
29
28
30
29
```bash
31
-
driver/create-validation-parameters.sh
32
-
driver/validate.sh
33
-
driver/benchmark.sh
30
+
scripts/load.sh
34
31
```
35
32
33
+
### Running the benchmark driver
34
+
35
+
The instructions below explain how to run the benchmark driver in one of the three modes (create validation parameters, validate, benchmark). For more details on the driver modes, check the ["Driver modes" section of the main README](../README.md#driver-modes).
36
+
37
+
#### Create validation parameters
38
+
39
+
1. Edit the `driver/benchmark.properties` file. Make sure that the `ldbc.snb.interactive.scale_factor`, `ldbc.snb.interactive.updates_dir`, `ldbc.snb.interactive.parameters_dir` properties are set correctly and are in sync.
40
+
41
+
2. Run the script:
42
+
43
+
```bash
44
+
driver/create-validation-parameters.sh
45
+
```
46
+
47
+
#### Validate
48
+
49
+
1. Edit the `driver/validate.properties` file. Make sure that the `validate_database` property points to the file you would like to validate against.
50
+
51
+
2. Run the script:
52
+
53
+
```bash
54
+
driver/validate.sh
55
+
```
56
+
57
+
#### Benchmark
58
+
59
+
1. Edit the `driver/benchmark.properties` file. Make sure that the `ldbc.snb.interactive.scale_factor`, `ldbc.snb.interactive.updates_dir`, and `ldbc.snb.interactive.parameters_dir` properties are set correctly and are in sync.
60
+
61
+
2. Run the script:
62
+
63
+
```bash
64
+
driver/benchmark.sh
65
+
```
66
+
67
+
#### Reload between runs
68
+
36
69
:warning: The default workload contains updates which are persisted in the database. Therefore, **the database needs to be reloaded or restored from backup before each run**. Use the provided `scripts/backup-database.sh` and `scripts/restore-database.sh` scripts to achieve this.
Copy file name to clipboardExpand all lines: graphdb/README.md
+34-9Lines changed: 34 additions & 9 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -31,9 +31,11 @@ An example configuration for scale factor 1 is given in the [`params-ttl.ini`](h
31
31
32
32
> The result of the execution will generate three .ttl files `social_network_activity_0_0.ttl`, `social_network_person_0_0.ttl` and `social_network_static_0_0.ttl`
33
33
34
+
## Running the benchmark
35
+
34
36
### Preprocessing and loading
35
37
36
-
After that you need to change the following environment variables based on your data source.
38
+
Change the following environment variables based on your data source.
37
39
38
40
1. Set the `GRAPHDB_IMPORT_TTL_DIR` environment variable to point to the generated data set. Its default value points to the example data set under the `test-data` directory:
39
41
@@ -66,17 +68,40 @@ scripts/start-graphdb.sh
66
68
> scripts/one-step-load.sh
67
69
>```
68
70
69
-
## Running the benchmark
71
+
### Running the benchmark driver
70
72
71
-
4. To run the scripts of benchmark framework, edit the `driver/{create-validation-parameters,validate,benchmark}.properties` files, then run their script, one of:
73
+
The instructions below explain how to run the benchmark driver in one of the three modes (createvalidationparameters,validate,benchmark). For more details on the driver modes, check the ["Driver modes" section of the main README](../README.md#driver-modes).
72
74
73
-
```bash
74
-
driver/create-validation-parameters.sh
75
-
driver/validate.sh
76
-
driver/benchmark.sh
77
-
```
75
+
#### Create validation parameters
76
+
77
+
1. Edit the `driver/benchmark.properties` file. Make sure that the `ldbc.snb.interactive.scale_factor`, `ldbc.snb.interactive.updates_dir`, `ldbc.snb.interactive.parameters_dir` properties are set correctly and are in sync.
78
+
79
+
2. Run the script:
80
+
81
+
```bash
82
+
driver/create-validation-parameters.sh
83
+
```
84
+
85
+
#### Validate
86
+
87
+
1. Edit the `driver/validate.properties` file. Make sure that the `validate_database` property points to the file you would like to validate against.
88
+
89
+
2. Run the script:
90
+
91
+
```bash
92
+
driver/validate.sh
93
+
```
94
+
95
+
#### Benchmark
96
+
97
+
1. Edit the `driver/benchmark.properties` file. Make sure that the `ldbc.snb.interactive.scale_factor`, `ldbc.snb.interactive.updates_dir`, and `ldbc.snb.interactive.parameters_dir` properties are set correctly and are in sync.
78
98
79
-
:warning:*Note that the default workload contains updates which are persisted in the database. Therefore, the database needs to be re-loaded between steps – otherwise repeated updates would insert duplicate entries.*
99
+
2. Run the script:
80
100
101
+
```bash
102
+
driver/benchmark.sh
103
+
```
81
104
105
+
#### Reload between runs
82
106
107
+
:warning: The default workload contains updates which are persisted in the database. Therefore, **the database needs to be reloaded or restored from backup before each run**. Use the provided `scripts/backup-database.sh` and `scripts/restore-database.sh` scripts to achieve this.
The default configuration of the database (e.g. database name, user, password) is set in the `scripts/vars.sh` file.
@@ -71,7 +73,7 @@ The default configuration of the database (e.g. database name, user, password) i
71
73
72
74
### Running the benchmark driver
73
75
74
-
Run the benchmark driver in one of the three modes (create validation parameters, validate, benchmark).
76
+
The instructions below explain how to run the benchmark driver in one of the three modes (create validation parameters, validate, benchmark). For more details on the driver modes, check the ["Driver modes" section of the main README](../README.md#driver-modes).
75
77
76
78
#### Create validation parameters
77
79
@@ -85,7 +87,7 @@ Run the benchmark driver in one of the three modes (create validation parameters
85
87
86
88
#### Validate
87
89
88
-
1. Edit the `driver/validate.properties` file. Make sure that the `validate_database` property points to the input CSV file.
90
+
1. Edit the `driver/validate.properties` file. Make sure that the `validate_database` property points to the file you would like to validate against.
0 commit comments