Step 1 - Why Docker?

Docker/containers are important for a few reasons -

Kubernetes/Container orchestration
Running processes in isolated environments
Starting projects/auxilary services locally

Step 2 - Containerization

What are containers

Containers are a way to package and distribute software applications in a way that makes them easy to deploy and run consistently across different environments. They allow you to package an application, along with all its dependencies and libraries, into a single unit that can be run on any machine with a container runtime, such as Docker.

Why containers

Everyone has different Operating systems
Steps to run a project can vary based on OS
Extremely harder to keep track of dependencies as project grows

Benefits of using containers

Screenshot_2024-03-09_at_1.01.25_PM.png

Let you describe your configuration in a single file
Can run in isolated environments
Makes Local setup of OS projects a breeze
Makes installing auxiliary services/DBs easy

References

For Reference, the following command starts mongo in all operating systems -

docker run -d -p 27017:27017 mongo

Docker isn’t the only way to create containers

Step 3 - History of Docker

Docker is a YC backed company, started in ~2014

They envisioned a world where containers would become mainstream and people would deploy their applications using them

That is mostly true today

Most projects that you open on Github will/should have docker files in them (a way to create docker containers)

Ref - https://www.ycombinator.com/blog/solomon-hykes-docker-dotcloud-interview/

Step 4 - Installing docker

https://docs.docker.com/engine/install/

Screenshot_2024-03-09_at_1.19.12_PM.png

Make sure you’re able to run the docker cli locally -

Screenshot_2024-03-09_at_1.23.18_PM.png

Step 5 - Inside docker

Screenshot_2024-03-09_at_2.08.40_PM.png

As an application/full stack developer, you need to be comfortable with the following terminologies -

Docker Engine
Docker CLI - Command line interface
Docker registry

1. Docker Engine

Docker Engine is an open-source containerization technology that allows developers to package applications into container

Containers are standardized executable components combining application source code with the operating system (OS) libraries and dependencies required to run that code in any environment.

2. Docker CLI

The command line interface lets you talk to the docker engine and lets you start/stop/list containers

docker run -d -p 27017:27017 mongo

💡 Docker cli is not the only way to talk to a docker engine. You can hit the docker REST API to do the same things

3. Docker registry

The docker registry is how Docker makes money.

It is similar to github, but it lets you push images rather than sourcecode

Docker’s main registry - https://dockerhub.com/

Mongo image on docker registry - https://hub.docker.com/_/mongo

Step 6 - Images vs containers

Docker Image

A Docker image is a lightweight, standalone, executable package that includes everything needed to run a piece of software, including the code, a runtime, libraries, environment variables, and config files.

💡 A good mental model for an image is Your codebase on github

Docker Container

A container is a running instance of an image. It encapsulates the application or service and its dependencies, running in an isolated environment.

💡 A good mental model for a container is when you run node index.js on your machine from some source code you got from github

Screenshot_2024-03-09_at_2.25.41_PM.png

Step 7 - Port mapping

docker run -d -p 27018:27017 mongo

Screenshot_2024-03-09_at_5.16.57_PM.png

Step 8 - Common docker commands

docker images
docker ps
docker run
docker build

1. docker images

Shows you all the images that you have on your machine

2. docker ps

Shows you all the containers you are running on your machine

3. docker run

Lets you start a container

-p ⇒ let’s you create a port mapping
-d. ⇒ Let’s you run it in detatched mode

4. docker build

Lets you build an image. We will see this after we understand how to create your own Dockerfile

5. docker push

Lets you push your image to a registry

6. Extra commands

docker kill
docker exec

Step 9 - Dockerfile

What is a Dockerfile

If you want to create an image from your own code, that you can push to dockerhub, you need to create a Dockerfile for your application.

A Dockerfile is a text document that contains all the commands a user could call on the command line to create an image.

How to write a dockerfile

A dockerfile has 2 parts

Base image
Bunch of commands that you run on the base image (to install dependencies like Node.js)

Let’s write our own Dockerfile

Let’s try to containerise this backend app - https://github.com/100xdevs-cohort-2/week-15-live-1

Screenshot_2024-03-09_at_3.17.15_PM.png

<details> <summary>Solution</summary>

FROM node:20

WORKDIR /app

COPY . .

RUN npm install
RUN npx prisma generate
RUN npm run build

EXPOSE 3000

CMD ["node", "dist/index.js"]

</details>

Common commands

WORKDIR: Sets the working directory for any RUN, CMD, ENTRYPOINT, **COPY**instructions that follow it.
RUN: Executes any commands in a new layer on top of the current image and commits the results.
CMD: Provides defaults for executing a container. There can only be one CMD instruction in a Dockerfile.
EXPOSE: Informs Docker that the container listens on the specified network ports at runtime.
ENV: Sets the environment variable.
COPY: Allow files from the Docker host to be added to the Docker image

https://github.com/100xdevs-cohort-2/week-15-live-1

Step 10 - Building images

Now that you have a dockerfile in your project, try building a docker image from it

docker build -t image_name .

Now if you try to look at your images, you should notice a new image created

docker images

💡 Add a .dockerignore so that node_modules don’t get copied over

Step 11 - Running images

docker run -p 3000:3000 image_name

Try visiting localhost:3000

Screenshot_2024-03-09_at_4.14.02_PM.png

Step 12 - Passing in env variables

docker run -p 3000:3000 -e DATABASE_URL="postgres://avnadmin:AVNS_EeDiMIdW-dNT4Ox9l1n@pg-35339ab4-harkirat-d1b9.a.aivencloud.com:25579/defaultdb?sslmode=require" image_name

The -e argument let’s you send in environment variables to your node.js app

Step 13 - More commands

docker kill - to kill a container
docker exec - to exectue a command inside a container

Examples

List all contents of a container folder

docker exec <container_name_or_id> ls /path/to/directory

Running an Interactive Shell

docker exec -it <container_name_or_id> /bin/bash

Step 14 - Pushing to dockerhub

Once you’ve created your image, you can push it to dockerhub to share it with the world.

Signup to dockerhub
Create a new repository
Login to docker cli
1. docker login
2. you might have to create an access token - https://docs.docker.com/security/for-developers/access-tokens/
Push to the repository

docker push your_username/your_reponame:tagname

Step 15 - Layers in Docker

In Docker, layers are a fundamental part of the image architecture that allows Docker to be efficient, fast, and portable. A Docker image is essentially built up from a series of layers, each representing a set of differences from the previous layer.

How layers are made -

Base Layer: The starting point of an image, typically an operating system (OS) like Ubuntu, Alpine, or any other base image specified in a Dockerfile.
Instruction Layers: Each command in a Dockerfile creates a new layer in the image. These include instructions like RUN, COPY, which modify the filesystem by installing packages, copying files from the host to the container, or making other changes. Each of these modifications creates a new layer on top of the base layer.
Reusable & Shareable: Layers are cached and reusable across different images, which makes building and sharing images more efficient. If multiple images are built from the same base image or share common instructions, they can reuse the same layers, reducing storage space and speeding up image downloads and builds.
Immutable: Once a layer is created, it cannot be changed. If a change is made, Docker creates a new layer that captures the difference. This immutability is key to Docker's reliability and performance, as unchanged layers can be shared across images and containers.

Step 16 - Layers practically

For a simple Node.js app - https://github.com/100xdevs-cohort-2/week-15-live-2

Dockerfile

Screenshot_2024-03-10_at_1.29.42_PM.png

Logs

Screenshot_2024-03-10_at_1.31.53_PM.png

Observations -

Base image creates the first layer
Each RUN, COPY , WORKDIR command creates a new layer
Layers can get re-used across docker builds (notice CACHED in 1/6)

Step 17 - Why layers?

If you change your Dockerfile, layers can get re-used based on where the change was made

💡 If a layer changes, all subsequent layers also change

Case 1 - You change your source code

Screenshot_2024-03-10_at_1.38.04_PM.png

<details> <summary>Logs</summary>

Screenshot_2024-03-10_at_1.41.27_PM.png

</details>

Case 2 - You change the package.json file (added a dependency)

Screenshot_2024-03-10_at_1.38.04_PM.png

<details> <summary>Logs</summary>

Screenshot_2024-03-10_at_1.41.27_PM.png

</details>

Thought experiment

How often in a project do you think dependencies change ?

How often does the npm install layer need to change?

Wouldn’t it be nice if we could cache the npm install step considering dependencies don’t change often?

Step 18 - Optimising Dockerfile

What if we change the Dockerfile a bit -

Screenshot_2024-03-10_at_2.24.00_PM.png

<details> <summary>Dockerfile</summary>

FROM node:20

WORKDIR /usr/src/app

COPY package* .
COPY ./prisma .
    
RUN npm install
RUN npx prisma generate

COPY . .

EXPOSE 3000

CMD ["node", "dist/index.js", ]

</details>

We first copy over only the things that npm install and npx prisma generate need
Then we run these scripts
Then we copy over the rest of the source code

Case 1 - You change your source code (but nothing in package.json/prisma)

Screenshot_2024-03-10_at_2.29.47_PM.png

Case 2 - You change the package.json file (added a dependency)

Screenshot_2024-03-10_at_2.30.51_PM.png

Step 19 - Networks and volumes

Networks and volumes are concepts that become important when you have multiple containers running in which you

Need to persist data across docker restarts
Need to allow containers to talk to each other

Screenshot_2024-03-10_at_3.28.40_PM.png

💡 We didn’t need networks until now because when we started the mongo container, it was being accessed by a Node.js process running directly on the machine

Step 20 - Volumes

If you restart a mongo docker container, you will notice that your data goes away.

This is because docker containers are transitory (they don’t retain data across restarts)

Without volumes

Start a mongo container locally

docker run -p 27017:27017 -d mongo

Open it in MongoDB Compass and add some data to it

Screenshot_2024-03-10_at_3.44.35_PM.png

Screenshot_2024-03-10_at_3.48.44_PM.png

Kill the container

docker kill <container_id>

Restart the container

docker run -p 27017:27017 -d mongo

Try to explore the database in Compass and check if the data has persisted (it wouldn’t)

With volumes

Create a volume

docker volume create volume_database

Mount the folder in mongo which actually stores the data to this volume

docker run -v volume_database:/data/db -p 27017:27017 mongo

Open it in MongoDB Compass and add some data to it

Screenshot_2024-03-10_at_3.44.35_PM.png

Screenshot_2024-03-10_at_3.48.44_PM.png

Kill the container

docker kill <container_id>

Restart the container

docker run -v volume_database:/data/db -p 27017:27017 mongo

Try to explore the database in Compass and check if the data has persisted (it will!)

Screenshot_2024-03-10_at_4.23.28_PM.png

Step 21 - Network

In Docker, a network is a powerful feature that allows containers to communicate with each other and with the outside world.

Docker containers can’t talk to each other by default.

localhost on a docker container means it's own network and not the network of the host machine

Screenshot_2024-03-10_at_4.32.34_PM.png

How to make containers talk to each other?

Attach them to the same network

Clone the repo - https://github.com/100xdevs-cohort-2/week-15-live-2.2
Build the image

docker build -t image_tag .

Create a network

docker network create my_custom_network

Start the backend process with the network attached to it

docker run -d -p 3000:3000 --name backend --network my_custom_network image_tag

Start mongo on the same network

docker run -d -v volume_database:/data/db --name mongo --network my_custom_network -p 27017:27017 mongo

Check the logs to ensure the db connection is successful

docker logs <container_id>

Try to visit an endpoint and ensure you are able to talk to the database
If you want, you can remove the port mapping for mongo since you don’t necessarily need it exposed on your machine

Screenshot_2024-03-10_at_5.16.46_PM.png

Types of networks

Bridge: The default network driver for containers. When you run a container without specifying a network, it's attached to a bridge network. It provides a private internal network on the host machine, and containers on the same bridge network can communicate with each other.
Host: Removes network isolation between the container and the Docker host, and uses the host's networking directly. This is useful for services that need to handle lots of traffic or need to expose many ports.

Step 22 - docker-compose

Docker Compose is a tool designed to help you define and run multi-container Docker applications. With Compose, you use a YAML file to configure your application's services, networks, and volumes. Then, with a single command, you can create and start all the services from your configuration.

Screenshot_2024-03-10_at_5.36.58_PM.png

Before docker-compose

Create a network

docker network create my_custom_network

Create a volume

docker volume create volume_database

Start mongo container

docker run -d -v volume_database:/data/db --name mongo --network my_custom_network  mongo

Start backend container

docker run -d -p 3000:3000 --name backend --network my_custom_network backend

After docker-compose

Install docker-compose - https://docs.docker.com/compose/install/
Create a yaml file describing all your containers and volumes (by default all containers in a docker-compose run on the same network) <details> <summary>Solution</summary>

version: '3.8'
services:
  mongodb:
    image: mongo
    container_name: mongodb
    ports:
      - "27017:27017"
    volumes:
      - mongodb_data:/data/db

  backend22:
    image: backend
    container_name: backend_app
    depends_on:
      - mongodb
    ports:
      - "3000:3000"
    environment:
      MONGO_URL: "mongodb://mongodb:27017"

volumes:
  mongodb_data:

</details>

Start the compose

docker-compose up

Stop everything (including volumes)

docker-compose down --volumes