added articles

emptymalei · emptymalei · commit d041f2602e10 · 2020-03-10T14:30:19.000+01:00
diff --git a/_articles/covid-eu-cases.md b/_articles/covid-eu-cases.md
@@ -10,13 +10,120 @@ comments: true
 types:
 - 'dataset'
 category:
-- epidemic
+- github
 tag:
-- 'Epidemic'
-summary: Automated data collection using GitHub Actions.
+- 'GitHub Actions'
+summary: covid19-eu-zh/covid19-eu-data is an automated COVID-19 confirmed cases data collection experiment using GitHub Actions.
 dataset:
   - id: covid19_eu_data
 references:
   - name: "Data Mining: Concepts and Techniques"
     link: https://www.amazon.com/Data-Mining-Concepts-Techniques-Management/dp/0123814790
 ---
+
+The [covid19-eu-zh/covid19-eu-data](https://github.com/covid19-eu-zh/covid19-eu-data) repository is an experiment of data scraping and aggregation using GitHub Actions.
+
+## Structure
+
+The structure of the project is as follows.
+
+```
+.
+├── README.md
+├── dataset     #where the data files lives
+├── documents   #where the raw data and files lives
+├── now.json    #zeit now setup for a FAAS service
+└── scripts     #scripts to download and aggregate data
+```
+
+### Scripts
+
+We have a python script for each country for more flexible schedules of each country. We are using classes from `utils.py` so that the scripts all have similar structure.
+
+```
+scripts
+├── download_at.py
+├── download_de.py
+├── download_es.py
+├── download_fr.py
+├── download_nl.py
+├── download_uk.py
+├── requirements.txt
+└── utils.py
+```
+
+### Dataset
+
+The dataset folder contains the full dataset of each country and the daily pdates of each country.
+
+```
+dataset
+├── covid-19-at.csv
+├── covid-19-de.csv
+├── covid-19-nl.csv
+├── covid-19-uk.csv
+└── daily
+    ├── at
+    ├── de
+    ├── nl
+    └── uk
+```
+
+## GitHub Actions
+
+We manage the pipelines using GitHub Actions. The full set of workflows is found in [the original repository](https://github.com/covid19-eu-zh/covid19-eu-data/actions).
+
+We use Germany as an example. In the workflow for Germany, we have two trigger, pushing to master branch and schedule. The job steps are
+
+1. Checkout the repository;
+2. Setup python and install python requirements;
+3. Run the python script to download and aggregate data;
+4. Push data to repository.
+
+{% highlight yaml %}
+name: CI Download DE SARS-COV-2 Cases from RKI
+
+on:
+  push:
+    branches:
+      - master
+  schedule:
+    - cron:  '0 7/1 * * *'
+
+jobs:
+  build:
+
+    runs-on: ubuntu-latest
+
+    steps:
+      - name: Checkout current repo
+        uses: actions/checkout@v2
+      - name: Get current directory and files
+        run: |
+          pwd
+          ls
+      - uses: actions/setup-python@v1
+        with:
+          python-version: '3.7' # Version range or exact version of a Python version to use, using SemVer's version range syntax
+          architecture: 'x64' # optional x64 or x86. Defaults to x64 if not specified
+      - name: Install Python Requirements
+        run: |
+          python --version
+          pip install -r scripts/requirements.txt
+      - name: Download Records
+        run: |
+          python scripts/download_de.py
+          ls dataset/daily/de
+          git config --local user.email "action@github.com"
+          git config --local user.name "GitHub Action"
+          git pull
+          git status
+          git add .
+          git commit -m "Update DE Dataset" || echo "Nothing to update"
+          git status
+      - name: Push changes
+        uses: ad-m/github-push-action@master
+        with:
+          repository: covid19-eu-zh/covid19-eu-data
+          github_token: {% raw  %}${{ secrets.GITHUB_TOKEN }}{% endraw  %}
+{% endhighlight %}
diff --git a/_layouts/articles.html b/_layouts/articles.html
@@ -36,7 +36,7 @@ <h1 class="post-title has-text-centered is-size-1" itemprop="name headline">{{ p
   <div class="is-divider" data-content="AUTHORS"></div>
     {% for author in page.authors %}
       {% assign author_db = site.data.authors[author.id] %}
-          <div class="box">
+          <div class="box is-size-7">
             <article class="media">
               <div class="media-content">
                 <div class="content">
diff --git a/about.md b/about.md
@@ -23,6 +23,7 @@ DataHerb do **not** take your data. The datasets are fully managed by the owners
 DataHerb is an initiative for transparent data management in open data. To achieve transparency, we use a metadata-driven design. Every step is transparent and can be investigated.
 
 - Contribute datasets: list your datasets on DataHerb in just two steps. Datasets that can be used to enhance machine learning datasets are preferred. [Tutorial]({{ site.baseurl }}/add)
+- Write a short story to tell us about the story behind your dataset and submit to [DataHerb Articles]({{ site.baseurl }}/articles).
 - Use DataHerb in your projects.
 - Spread the words.
 - Help us build a better DataHerb. [GitHub Organization](https://github.com/dataherb); [Leave a comment](#comments)
diff --git a/community/dataherb-python.md b/community/dataherb-python.md
diff --git a/community/index.md b/community/index.md
@@ -6,3 +6,5 @@ comments: true
 ---
 
 DataHerb is also a community for data sharing.
+
+Join our telegram channel: [DataHerb Telegram Channel](https://t.me/dataherb).