Deployment

## Intro to deployment

### Architecturing (web) applications

![api](static/img/webapp-architecture.jpg)

### Communicating between applications

[Application Programming Interface](https://glossary.cncf.io/application-programming-interface/)

> Une API (application programming interface ou « interface de programmation d’application ») est une interface logicielle qui permet de « connecter » un logiciel ou un service à un autre logiciel ou service afin d’échanger des données et des fonctionnalités.

### API

![api](static/img/api-infographic.jpg)

### REST API

Representational state transfer (REST)

![rest](static/img/restful-api.png)

### Microservices vs "monoliths"

![microservices](static/img/microservices-architecture.png)

PS: [Microservices are hard](https://dwmkerr.com/the-death-of-microservice-madness-in-2018/)

### Multi applications & docker

![docker-compose](static/img/docker-compose-diagram.png)

### How does it relate to me ?

### Hands-On

- **Backend** How to expose an ML model to a community of users through a web app
- **Frontend** How to build a companion app to interact with your model in an ergonomic fashion
- **Deployment** How to deploy both applications on a single instance

In the data science workflow,

![](static/img/ml-workflow.png)

I have an **awesome** ML model

![model](static/img/model-deployment-meme-1.jpg)

(Just kidding)

![model](static/img/model-deployment-meme-2.jpg)

I want to deploy it on the cloud for other to use

![model](static/img/mistral-model-serving.png)

Today we will do it by hand

![deploy](static/img/model-deployment-options.png)

Other methods for ML Model packaging behind a web server

![package](static/img/model-packaging.png)

- cog : https://github.com/replicate/cog
- pesto : https://github.com/AirbusDefenceAndSpace/pesto
- litserve : https://github.com/Lightning-AI/LitServe

Interaction with user ? We use CURL 👎

```bash
curl -X POST "http://my-instance/predict" \
    -H  "accept: application/json" \
    -H  "Content-Type: application/json" \
    -d "{\"model\":\"string\",\"image\":\"...\"}"
```

Interaction with user ? We use CURL 👎

![json](static/img/json-request-response.png)

Interaction with users ? 👍

![results](static/img/api-results-example.png)

Webapp builder for data scientists

![streamlit](static/img/streamlit-app.png)

[you've seen it before](http://supaerodatascience.github.io/DE/1_4_be.html#6-lets-discover-streamlit)

Webapp builder for data scientists

- [streamlit](https://streamlit.io/)
- [gradio](https://gradio.app/)

Let's build it !

- A model behind a Restful API, packaged in a docker
- A frontend using streamlit, packaged in a docker
- Deploy it on Google Cloud Platform using GCE & docker-compose
- Send it to your friends !

In reality, it's much more complex...

![](static/img/model-serving-complex.png)

How to scale deployment ?

- [CS229 - Class by Chip Huyen](https://docs.google.com/presentation/d/1U_zKs19VLJKnGE02JDRnzxJ8lgeVF22WSZ_GrA646fY/edit#slide=id.p)
- [CS229 - Deployment with Ray Serve](https://github.com/anyscale/academy/blob/main/ray-serve/e2e/tutorial.ipynb)
- https://docs.ray.io/en/latest/serve/develop-and-deploy.html

Some links

- https://github.com/EthicalML/awesome-production-machine-learning
- Machine Learning System Designs https://stanford-cs329s.github.io/