Skip to content

Commit 9259c68

Browse files
committed
Readme
1 parent d465e09 commit 9259c68

File tree

1 file changed

+1
-3
lines changed

1 file changed

+1
-3
lines changed

README.md

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -63,14 +63,12 @@ If you are interested in contributing to the project, please make sure to open a
6363
To support a new data source, the first step is to think a Python object that can represent one file. For instance, a CSV file can be represented as a `pandas.Dataframe`. Then, the following functions need to be expanded. Please read the documentation of each function before writing changes.
6464

6565
* `_process_source()`. The function must return three Python objects, each one with the following information.
66+
- The set of current data, which represents the data from the current version of the data source.
6667
- The set of new data, which represents the data present in the data source but not in the snapshot.
6768
- The set of removed data, which represents the data present in the snapshot but not in the data source.
68-
- The set of old data, which represents the data from the snapshot.
6969

7070
* `_save_data_to_file()`. This function must serialize the Python object used to represent each source.
7171

72-
* `_calculate_new_snapshot_df()`. This function calculates the new snapshot data by adding the set of new data and subtracting the set of removed data. These sets of data are the ones returned by `_process_source()`.
73-
7472
### Supporting new mapping engines
7573

7674
In order to support a new mapping engine, the function `_materialize_set()` must be expanded. The function should return a `rdflib.Graph` containing the generated triples. Note that if the new mapping engine is not written in python, it could be possible to run a script with `subprocess.run` and then read the output triples with `rdflib`.

0 commit comments

Comments
 (0)