Make Your Own Artificial Intelligence Driven Web App

The app will capture images and then apply artificial intelligence to identify objects in the image.

Try it running on Firebase

Make your own with REPLIT.com

Follow the instructions below to build your own!

Requirements:

gmail account
webcam
internet connection

Discover the Requirements

What are we trying to build, and what should our app be able to do? Before we can build something, we should have a goal in mind.

For this project, we can start with a high level, abstract goal of "Create an app that identifies objects," but a complicated goal like that should be broken it into smaller, more specific requirements.

Capture images from the camera
Identify the objects in the image
Label the images in the image
Make the app available for anyone to use

It would be very difficult to understand how to acheive the abstract goal, but by breaking it into smaller, more specific requirements, we make it much easier to understand how to solve the problem.

Design a Solution

Now that we know what the specific requirements are, we can start designing a solution and researching techniques to satisfy each of the requirements. Before we can start coding, we need to break the problem down into tasks based on the the specific requirements that we just discovered.

For example, when researching how to capture images from the camera, we might come across the article Taking still photos with getUserMedia.

Similarly, our research into identifying objects in an image would reveal that a popular way of accomplishing this is via artificial intelligence and and that there are a variety of free models such as the free tensorflow.js object detection model that we can use.

To add labels to the image we capture, we will find that the canvas element will allow us to do so easily.

Finally, to make the app available, the easiest way to do so is via a cloud provider and Firebase is incredibly easy to get started.

Thus we can solve the problem with the following steps:

Create a Firebase app that we will gradually build out
Update the app to capture images like in Taking still photos with getUserMedia
Add the AI model to detect objects in the captured images using the free object detection model
Finally, use predictions from the model and the captured image to draw the image with labels on the canvas element

Now that we have a design, we can start coding our app!

Get Ready to Code

Fork this REPL
Clean up any previous solutions with the provided script; just run npm run step-0

Setup Firebase

To serve and eventually host our app, we will use Firebase.

Firebase provides access to many of the same resource and services as Google Cloud Platform with a few differences

Firebase can be uses without a credit card. Both Google Cloud Platform and Firebase offer free trials, but Google Cloud Platform requires a credit card to start.
Firebase offers a small subset of the resources and services that Google Cloud Platform does, but those that it does offer are what is most commonly used for simple applications.

Firebase is ideal for experimental or early stage projects.

Visit and login to Firebase (may need to login with Google credentials).
Create a project
View setup instructions
Setup hosting via the CLI 1. login to firebase from the command line by running firebase login --no-localhost, you could also run npm run step-1
1. initialize firebase by running firebase init, you could also fun npm run step-2
Serve the default app: firebase serve -o 0.0.0.0 (or npm run step-3), after a few seconds the window should appear automatically.
Deploy the default app: firebase deploy (or npm run step-4)
Visit your app running on the web at the "Hosting URL"

Capture Images

We will base our app on the excellent article on MDN about Taking still photos with getUserMedia.

This article has great explanations of the concepts used as well as source code that we can download to run the app ourselves.

I encourage everyone to have a look at the article for more information once we are done.

For now, we will simply copy the source code and use it as the foundation of our app using the provided script.

Run npm run step-5 this will replace the default app
Run firebase serve -o 0.0.0.0 (or npm run step-6) to see the new app
Run firebase deploy (or npm run step-7) to deploy the new version
Visit your app at the hosting url again to see the image capture app

Add Machine Learning

You can access a variety of pretrained models from https://www.tensorflow.org/js/models which are ready to use. For our application we will use the object detection model

Add the required scripts to public/index.html to load the model:

<!-- Load TensorFlow.js. This is required to use coco-ssd model. -->
<script src="https://cdn.jsdelivr.net/npm/@tensorflow/tfjs"></script>
<!-- Load the coco-ssd model. -->
<script src="https://cdn.jsdelivr.net/npm/@tensorflow-models/coco-ssd"></script>

Update the Javascript code to use the model and identify objects in the image. Add the following changes to public/capture.js:

Change #1: Create a function to load the machine learning model and store it for later

var model = null;

function getModel() {
  return new Promise((resolve) => {
    if (model) {
      resolve(model);
    }
    cocoSsd.load().then((loadedModel) => {
      model = loadedModel;
      resolve(model);
    });
  });
}

Change #2: Load the machine learning model and store it for later
- ```
getModel();
```

Change #3: Use the machine learning model to predict the objects in the picture and label them in the picture

var img = document.getElementById('photo');
getModel().then((model) => {
  // detect objects in the image.
  model.detect(img).then((predictions) => {
    console.log('Predictions: ', predictions);
    context.font = '24px serif';
    context.strokeStyle = 'green';
    context.fillStyle = 'green';

    for (const prediction of predictions) {
      const [x, y, height, width] = prediction.bbox;
      context.strokeRect(x, y, height, width);
      context.fillText(prediction.class, x, y);
    }

    const data = canvas.toDataURL('image/png');
    photo.setAttribute('src', data);
  });
});

Run firebase serve to see the new version running locally
Run firebase deploy to deploy the new vewsion when it's working
Visit the app running on the hosted url
Share the app you built, and have some fun!

Extra Credit

Make the app run continuously to identify objects whenever they are in view of the camera Manipulating video using canvas - Web APIs | MDN
Make the app your own, with custom style, layout, and information
Make the app responsive (so that it works well on a phone)
Show an alert whenever an interesting object is detected.

Other Resources

Access this document online
- URL: https://github.com/albertpatterson/AI-Camera
- Scan this QR code with your phone

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
doc		doc
example		example
step		step
.firebaserc		.firebaserc
.gitignore		.gitignore
README.md		README.md
README_PRINTER_FRIENDLY.md		README_PRINTER_FRIENDLY.md
firebase.json		firebase.json
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Make Your Own Artificial Intelligence Driven Web App

Discover the Requirements

Design a Solution

Get Ready to Code

Setup Firebase

Capture Images

Add Machine Learning

Extra Credit

Other Resources

Notes

About

Releases

Packages

Languages

albertpatterson/AI-Camera

Folders and files

Latest commit

History

Repository files navigation

Make Your Own Artificial Intelligence Driven Web App

Discover the Requirements

Design a Solution

Get Ready to Code

Setup Firebase

Capture Images

Add Machine Learning

Extra Credit

Other Resources

Notes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages