remykarem
diff --git a/‎.gitignore
Lines changed: 4 additions & 1 deletion b/‎.gitignore
Lines changed: 4 additions & 1 deletion
diff --git a/‎MANIFEST.in
Lines changed: 1 addition & 0 deletions b/‎MANIFEST.in
Lines changed: 1 addition & 0 deletions
diff --git a/‎README.md
Lines changed: 27 additions & 15 deletions b/‎README.md
Lines changed: 27 additions & 15 deletions
diff --git a/‎p2j/__init__.py b/‎p2j/__init__.py
diff --git a/‎examples/example.py renamed to ‎p2j/examples/example.py b/‎examples/example.py renamed to ‎p2j/examples/example.py
diff --git a/‎examples/example2.ipynb renamed to ‎p2j/examples/example2.ipynb b/‎examples/example2.ipynb renamed to ‎p2j/examples/example2.ipynb
diff --git a/‎examples/example2.py renamed to ‎p2j/examples/example2.py b/‎examples/example2.py renamed to ‎p2j/examples/example2.py
diff --git a/‎examples/example3.ipynb renamed to ‎p2j/examples/example3.ipynb b/‎examples/example3.ipynb renamed to ‎p2j/examples/example3.ipynb
diff --git a/‎examples/example3.py renamed to ‎p2j/examples/example3.py b/‎examples/example3.py renamed to ‎p2j/examples/example3.py
diff --git a/‎p2j/p2j.py
Lines changed: 162 additions & 0 deletions b/‎p2j/p2j.py
Lines changed: 162 additions & 0 deletions
diff --git a/‎templates/cell_code.json renamed to ‎p2j/templates/cell_code.json b/‎templates/cell_code.json renamed to ‎p2j/templates/cell_code.json
diff --git a/‎templates/cell_markdown.json renamed to ‎p2j/templates/cell_markdown.json b/‎templates/cell_markdown.json renamed to ‎p2j/templates/cell_markdown.json
diff --git a/‎templates/metadata.json renamed to ‎p2j/templates/metadata.json b/‎templates/metadata.json renamed to ‎p2j/templates/metadata.json
@@ -1,2 +1,5 @@
 .DS_Store
-.ipynb_checkpoints
+.ipynb_checkpoints
+build
+p2j.egg-info
+dist
@@ -0,0 +1 @@
+include README.md LICENSE p2j/templates/*.json
@@ -1,29 +1,31 @@
-# py2nb (Beta)
+# p2j - Python to Jupyter Notebook
 
 Converts Python source code to Jupyter notebook.
 
-The purpose of this repo is so that we can run a code paragraph-by-paragraph and don't have to do that by copying each paragraph of the code into every cell. It's also useful if we want to run our code in Google Colab.
+The purpose of this package is so that we can run a code paragraph-by-paragraph and don't have to do that by copying each paragraph of the code into every cell. It's also useful if we want to run our code in Google Colab.
+
+In a nutshell, every paragraph of your code is transformed into a code cell.
 
 This parser isn't perfect, but you would be satisfactorily pleased with what you get.
 
 ## Installing
 
 ```bash
-git clone https://github.com/raibosome/code2notebook.git
+pip install p2j
 ```
 
 ## Running
 
 ```bash
-python py2nb.py code_to_parse.py
+p2j code_to_parse.py
 ```
 
-and you will get a `code_to_parse.ipynb` Jupyter notebook. See `python py2nb.py -h` for other arguments.
+and you will get a `code_to_parse.ipynb` Jupyter notebook. See `p2j -h` for other arguments.
 
 The `examples/example.py` is a Keras tutorial on building an autoencoder for the MNIST dataset, found [here](https://github.com/keras-team/keras/blob/master/examples/mnist_denoising_autoencoder.py). You can run the example:
 
 ```bash
-python py2nb.py examples/example.py
+p2j examples/example.py
 ```
 
 ## Tests
@@ -41,20 +43,30 @@ Jupyter notebooks are just JSON files. The `py2nb.py` reads the source code line
 ## Project Structure
 
 ```txt
-├── example.py              Example code
-├── py2nb.py                The code that parses and generates the notebook
-└── templates               JSON files that make up the final Jupyter notebook
-    ├── cell_code.json
-    ├── cell_markdown.json
-    └── metadata.json
+├── p2j             The parser module
+│   ├── __init__.py 
+│   ├── examples    Example codes that you can parse
+│   ├── p2j.py      Main file
+│   └── templates   JSON files needed to build the notebook
+├── README.md       This file
+├── LICENSE         Licensing
+├── MANIFEST.in     Python packaging-related
+├── build           Python packaging-related
+├── dist            Python packaging-related
+├── p2j.egg-info    Python packaging-related
+└── setup.py        Python packaging-related
 ```
 
 ## Code format
 
-The parser assumes a format where your code is paragraphed. Each paragraph has the comments part and/or the code part. The comments will be automatically converted to a markdown cell while the code will be, you guessed it, the code cell.
+There is no specific format that you should follow, but generally the parser assumes a format where your code is paragraphed. Each paragraph has the comments part and/or the code part. The comments will be automatically converted to a markdown cell while the code will be, you guessed it, the code cell.
 
-Some examples of well-documented code:
+Some examples of well-documented code (and from which you can test!):
 
 - [PyTorch Tutorials](https://pytorch.org/tutorials/beginner/pytorch_with_examples.html)
 - [Keras Examples](https://github.com/keras-team/keras/tree/master/examples)
-- [Scikit Learn Example](https://scikit-learn.org/stable/auto_examples/classification/plot_digits_classification.html#sphx-glr-auto-examples-classification-plot-digits-classification-py)
+- [Scikit Learn Example](https://scikit-learn.org/stable/auto_examples/classification/plot_digits_classification.html#sphx-glr-auto-examples-classification-plot-digits-classification-py)
+
+## Pull requests
+
+Pull requests are very much encouraged!
@@ -0,0 +1,162 @@
+"""
+This code translate .py files to .ipynb
+"""
+# Standard imports for file handling and JSON files
+import argparse
+import os
+import json
+import sys
+
+# Reserved Python keywords
+RESERVED = ['for', 'with', 'class', 'while']
+HERE = os.path.abspath(os.path.dirname(__file__))
+
+def main():
+
+    # Get source and target filenames
+    parser = argparse.ArgumentParser(description='Parse a file.')
+    parser.add_argument('source_filename', help='File to parse')
+    parser.add_argument('-t', '--target_filename', help='Target filename')
+    parser.add_argument('-o', '--overwrite', action='store_true',
+                        help='Flag to overwrite existing target file')
+    args = parser.parse_args()
+    source_filename = args.source_filename
+    target_filename = args.target_filename
+    overwrite = args.overwrite
+
+    # Check if source file exists and read
+    try:
+        with open(source_filename, 'r') as file:
+            data = [l.rstrip('\n') for l in file]
+    except FileNotFoundError:
+        print("Source file not found. Specify a valid source file.")
+        sys.exit(1)
+
+    # Check if target file is specified and exists. If not specified, create
+    if target_filename is None:
+        target_filename = os.path.splitext(source_filename)[0] + ".ipynb"
+        print("Target file not specified. Creating a default notebook with name {}.".format(
+            target_filename))
+    if not overwrite and os.path.isfile(target_filename):
+        print("File {} exists. Add -o flag to overwrite or specify a different name.".format(target_filename))
+        sys.exit(1)
+
+    # Read JSON files for .ipynb template
+    with open(HERE + '/templates/cell_code.json') as file:
+        code = json.load(file)
+    with open(HERE + '/templates/cell_markdown.json') as file:
+        markdown = json.load(file)
+    with open(HERE + '/templates/metadata.json') as file:
+        misc = json.load(file)
+
+    # Initialise variables
+    final = {}
+    cells = []
+    arr = []
+    num_lines = len(data)
+
+    # Initialise variables for checks
+    is_block_comment = False
+    end_paragraph = True
+    is_running_comment = False
+    is_running_code = False
+    next_is_code = False
+    next_is_nothing = False
+    next_is_comment = False
+    is_running_function = False
+    next_is_function = False
+
+    # Read source code line by line
+    for i, line in enumerate(data):
+
+        buffer = ""
+
+        # Check next line
+        try:
+            next_is_code = data[i+1][0] != "#"
+        except:
+            pass
+        try:
+            next_is_comment = data[i+1][0] == "#"
+        except:
+            pass
+        try:
+            next_is_nothing = data[i+1] == ""
+        except:
+            pass
+        try:
+            next_is_function = data[i+1][:4] == "    " or (
+                data[i+1] == "" and data[i+2][:4] == "    ")
+            # print(line)
+            # print(data[i+1][:4] == "")
+        except:
+            pass
+        end_of_code = i == num_lines-1
+
+        # Skip if line is empty
+        if line == "":
+            continue
+
+        # Sub-paragraph is a comment but not a running code
+        if (is_running_comment or (line[0] == "#" and (line[:8] != "# pylint" or line[:7] != "#pylint")) or line[:3] == "'''" or line[-3:] == "'''" or line[:3] == "\"\"\"" or line[-3:] == "\"\"\"") and not is_running_code:
+
+            if line[:3] == "'''" or line[-3:] == "'''" or line[:3] == "\"\"\"" or line[-3:] == "\"\"\"":
+                is_block_comment = not is_block_comment
+
+            if is_block_comment:
+                buffer = line.replace("'''", "").replace("\"\"\"", "")
+            else:
+                buffer = line[2:]
+
+            # Wrap this sub-paragraph as a cell
+            # if next line is code or next line is space or end of code
+            if end_of_code or (next_is_code and not is_block_comment) or (next_is_nothing and not is_block_comment):
+                arr.append(f"{buffer}")
+                markdown["source"] = arr
+                cells.append(dict(markdown))
+                arr = []
+                is_running_comment = False
+            else:
+                buffer = buffer + "<br>"
+                arr.append(f"{buffer}")
+                is_running_comment = True
+                continue
+        else:  # Sub-paragraph is a comment but not a running code
+            buffer = line
+
+            # Close this if next line is end of code or next is nothing
+            # Don't close if next is still part of a
+            # or not next_is_function) or (not next_is_function and next_is_nothing):
+            if (end_of_code or next_is_nothing) and not (next_is_nothing and next_is_function):
+                arr.append(f"{buffer}")
+                code["source"] = arr
+                cells.append(dict(code))
+                arr = []
+                is_running_code = False
+            else:
+                buffer = buffer + "\n"
+
+                # Put another newline character if in a function
+                try:
+                    if data[i+1] == "" and (data[i+2][:5] == "    #" or data[i+2][:9] == "        #"):
+                        buffer = buffer + "\n"
+                except:
+                    pass
+
+                arr.append(f"{buffer}")
+                is_running_code = True
+                continue
+
+    # Finalise the contents of notebook
+    final["cells"] = cells
+    final.update(misc)
+
+    # Write JSON to target file
+    with open(target_filename, 'w') as outfile:
+        json.dump(final, outfile)
+        print("Notebook {} written.".format(target_filename))
+
+
+if __name__ == "__main__":
+    print("Convert a Python script to Jupyter notebook")
+    main()
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1 @@`
	`1`	`+include README.md LICENSE p2j/templates/*.json`