Metadata-Version: 2.1
Name: babish-db
Version: 0.0.1
Summary: Data with Babish DB
Home-page: https://github.com/jklewa/data-with-babish
Author: jklewa
Author-email: jklewa@gmail.com
License: GPLv3+
Platform: UNKNOWN
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: GNU General Public License v3 or later (GPLv3+)
Classifier: Operating System :: OS Independent
Requires-Python: >=3.6
Description-Content-Type: text/markdown
Requires-Dist: beautifulsoup4
Requires-Dist: click
Requires-Dist: flask
Requires-Dist: flask-migrate
Requires-Dist: flask-sqlalchemy
Requires-Dist: furl
Requires-Dist: html5lib
Requires-Dist: measurement
Requires-Dist: psycopg2-binary
Requires-Dist: pyfunctional
Requires-Dist: requests

# _Data with Babish_

This project aims to analyze the recipes of the popular YouTube channel [Binging with Babish](http://bingingwithbabish.com) and convert them into beautiful data.

## Visualizations

* [jklewa.github.io/data-with-babish-example](https://jklewa.github.io/data-with-babish-example/) ([repo](https://github.com/jklewa/data-with-babish-example)) An interactive episode data viewer

## Datasets

 * [ibdb.episodes.json](datasets/ibdb.episodes.json) ([docs](#ibdbepisodesjson)) Episodes and related guests, recipes and inspiration
 * [ibdb.guests.json](datasets/ibdb.guests.json) ([docs](#ibdbguestsjson)) Guests and their appearances
 * [ibdb.recipes.json](datasets/ibdb.recipes.json) ([docs](#ibdbrecipesjson)) Recipes and their origin episode
 * [ibdb.references.json](datasets/ibdb.references.json) ([docs](#ibdbreferencesjson)) TV Shows, Movies, etc. and when they were referenced
 * [ibdb.shows.json](datasets/ibdb.shows.json) ([docs](#ibdbshowsjson)) Babish's Shows and their episode lists
 * [babish.json](datasets/babish.json) ([docs](#babishjson)) Parsed recipe ingredients, grouped by episode (Deprecated)

### [ibdb.episodes.json](datasets/ibdb.episodes.json)

  **Episodes** and related guests, recipes and inspiration in the format:

  ```python
  [
    {
      "episode_id": "epid",
      "name": "Episode Name",
      "published_date": "YYYY-MM-DD",
      "youtube_link": "https://www.youtube.com/watch?v=...",
      "official_link": "https://www.bingingwithbabish.com/recipes/...",
      "image_link": "https://preview.image.host/image.png",
      "related": {
        "show": {
          "show_id": 1,
          "name": "Binging with Babish"
        },
        "guests": [
          {
            "guest_id": 1,
            "name": "Guest Name"
          }
        ],
        "inspired_by": [
          {
            "reference_id": 1,
            "type": "tv_show|movie|youtube_channel|video_game|other",
            "name": "Reference Name",
            "description": "A description of the reference.",
            "external_link": "https://link.to.more/"
          }
        ],
        "recipes": [
          {
            "recipe_id": 1,
            "name": "Recipe Name",
            "raw_ingredient_list": "Ingedient 1\nIngredient 2\n...",
            "raw_procedure": "Step 1.\nStep 2.\n...",
            "ingredient_list": [
              [
                1.0,                   # quantity
                "tablespoon",          # unit
                "Butter",              # name
                "1 tablespoon butter"  # raw text from recipe
              ],
              # ...
            ]
          }
        ]
      }
    },
    # ...
  ]
  ```

### [ibdb.guests.json](datasets/ibdb.guests.json)

  **Guests** and their appearances in the format:

  ```python
  [
    {
      "guest_id": 1,
      "name": "Guest Name",
      "appearances": [
        {
          "episode_id": "epid",
          "name": "Episode Name",
          "published_date": "YYYY-MM-DD",
          "youtube_link": "https://www.youtube.com/watch?v=...",
          "official_link": "https://www.bingingwithbabish.com/recipes/...",
          "image_link": "https://preview.image.host/image.png"
        }
      ]
    },
    # ...
  ]
  ```

### [ibdb.recipes.json](datasets/ibdb.recipes.json)

  **Recipes** and their origin episode in the format:

  ```python
  [
    {
      "recipe_id": 1,
      "name": "Recipe Name",
      "raw_ingredient_list": "Ingedient 1\nIngredient 2\n...",
      "raw_procedure": "Step 1.\nStep 2.\n...",
      "source": {
        "episode_id": "epid",
        "name": "Episode Name",
        "published_date": "YYYY-MM-DD",
        "youtube_link": "https://www.youtube.com/watch?v=...",
        "official_link": "https://www.bingingwithbabish.com/recipes/...",
        "image_link": "https://preview.image.host/image.png"
      },
      "ingredient_list": [
        [
          1.0,                   # quantity
          "tablespoon",          # unit
          "Butter",              # name
          "1 tablespoon butter"  # raw text from recipe
        ],
        # ...
      ]
    },
    # ...
  ]
  ```

### [ibdb.references.json](datasets/ibdb.references.json)

  TV Shows, Movies, etc. **References** and when they were referenced in the format:

  ```python
  [
    {
      "reference_id": 1,
      "type": "tv_show|movie|youtube_channel|video_game|other",
      "name": "Reference Name",
      "description": "A description of the reference.",
      "external_link": "https://link.to.more/",
      "episodes_inspired": [
        {
          "episode_id": "epid",
          "name": "Episode Name",
          "published_date": "YYYY-MM-DD",
          "youtube_link": "https://www.youtube.com/watch?v=...",
          "official_link": "https://www.bingingwithbabish.com/recipes/...",
          "image_link": "https://preview.image.host/image.png"
        }
      ]
    },
    # ...
  ]
  ```

### [ibdb.shows.json](datasets/ibdb.shows.json)

  **Babish's Shows** and their episode lists in the format:

  ```python
  [
    {
      "show_id": 1,
      "name": "Binging with Babish",
      "episodes": [
        {
          "episode_id": "epid",
          "name": "Episode Name",
          "published_date": "YYYY-MM-DD",
          "youtube_link": "https://www.youtube.com/watch?v=...",
          "official_link": "https://www.bingingwithbabish.com/recipes/...",
          "image_link": "https://preview.image.host/image.png"
        }
      ]
    },
    # ...
  ]
  ```

### [babish.json](datasets/babish.json)

  **(Deprecated)**

  Contains ingredients from [BWB Recipes](http://bingingwithbabish.com/recipes) in the format:

  ```python
  [
    {
      "episode_name": "Episode Name",
      "episode_link": "https://www.bingingwithbabish.com/recipes/...",
      "youtube_link": "https://www.youtube.com/watch?v=...",
      "published": "YYYY-MM-DD",
      "recipes": [
        {
          "method": "Method Name (from Episode Name)",
          "ingredients": [
            [
              1.0,                   # quantity
              "tablespoon",          # unit
              "Butter",              # name
              "1 tablespoon butter"  # raw text from recipe
            ],
            # ...
          ]
        },
        # ...
      ]
    },
    # ...
  ]
  ```

### Resources

* Regex Samples: https://regexr.com/3p7h8 https://regexr.com/3p6pq
* Handling Unicode Fractions: https://stackoverflow.com/questions/1263796/how-do-i-convert-unicode-characters-to-floats-in-python

## Contributing

### Local Development
Required tools: **Docker, Docker Compose**

1. Build `docker-compose build`
2. Run DB and API `docker-compose up -d`
3. Browse http://localhost:5000/
4. Update DB and datasets `docker-compose exec ibdb sync update export`
5. See other commands `docker-compose exec ibdb --help`

This will use [populate_db.py](ibdb/populate_db.py) to scrape and upsert episodes into the DB and [export.py](ibdb/export.py) to generate new [datasets/](datasets) from the DB's contents.

You can also explore the original [populate_babish_json.py](ibdb/populate_babish_json.py) and [Jupyter Notebooks](http://ipython.org/notebook.html)
1. `cd notebooks/`
2. Start Jupyter on [http://localhost:8888](http://localhost:8888) `jupyter notebook`
3. Open `Babish Recipe Extract.ipynb` or `Babish Data Analysis.ipynb`

**NOTE:** Be aware that `Babish Recipe Extract.ipynb` will make **LOTS** of network calls to the official bingingwithbabish.com website. Calls are cached and rate limited but please be very considerate and only run them if absolutely necessary.

### Tests
Tests covering [recipe_parser.py](ibdb/recipe_parser.py) are located in the [tests/](tests/) directory and can be run using [pytest](https://docs.pytest.org/en/latest/).

Required tools: **Python 3.8**

1. Install packages `pip install -r requirements.txt`
2. Run tests `python -m pytest`

### Docs
* [Code of Conduct](CODE_OF_CONDUCT.md), [GNU General Public License v3.0](LICENSE)


