Object Detection with YOLO-V5 Implementation

This is an exploratory project of YOLO, the state-of-the-art real-time object detection model.

Authors

Haoxuan Wang @thomaswang0822
Jingyu Wu @leowubj

Screenshots

Image with prediction example

Our web-app

Update 1.0: Video Detection

We include the Video detection feature!

To look at a demo result, please visit our web-app home page.
To play around with your own video locally, please change the dir and filename in video_runner.py and run it.
You can also try it out with our web-app. In video tab, upload your own mp4 video and do video-object detection with a single click! Note: there is a very techinical comptability issue between browser video player and opencv video encoder type (like H264), so we used .webm format as a workaround. This increase the video writing time (numpy ndarray -> video), but model efficiency isn't affected.

Roadmap

Setup the Python environment with GPU enabled.
Choose a version of YOLO model to use (we have v5)
(Optional) Collect/Download your custom dataset and do basic data processing.
Train the model on your custom dataset or popular open source dataset, like COCO dataset.
For the latter choice, YOLO repo probably have a script that automates downloading and processing.
Construct a detection/prediction pipeline that uses the model we have trained.
Host this pipeline as a web app.

Usage

If you only want to see how object detection works, you can navigate to our web-app directly and upload your image.

Installation & Explanation

If you want to further explore our project, like to:

switch to a different version instead of v5
train on your choice of dataset
choose a different model size, see this
explore deeper to the YOLO

here is an brief guide on how to get everything to work.

Presumption: your device has GPU and a Python environment that can use your GPU.

(You can check this in tensorflow or pytorch)

Clone our repo (or make your own, store your custom dataset, and write corresponding image processing scripts)

Writing the image processing scripts isn't hard. You can refer to our prepare.ipynb. It has enough comment and very likely you only need to change path & filenames.

The hardest part is manually labeling the datasets not designed for object detection. This means you should draw the bounding boxes (bbox) and type the object class. Labeling can be achieved by package labelImg and it's included in our requirements.txt
Clone YOLOv5 or your choice of version inside our repo

  cd my-project # make sure you enter our repo
  git clone https://github.com/ultralytics/yolov5.git

Install requirements.txt

  # It contains necessary packages for our project.
  pip install -r requirements.txt

  # It contains necessary packages for playing around with YOLO.
  cd yolo-repo
  pip install -r requirements.txt

Create a data.yaml file inside yolo folder

For contents of this file, you can look at our data.yaml (if you use our data and thus our data.yaml, please remember to move it inside yolo folder)
Essentially, data.yaml tells YOLO where the training data & test data are and what object classes they have.

Test Training & Training

Since YOLO is a quite complex and delicate model, we suggest you make sure the training is error-free before you proceed to the actual training (which could take several hours).

cd yolo-repo

# test training
python train.py --batch-size 8 --epochs 5 --data data.yaml --name <TestModel> --cfg yolov5s.yaml --patience 5

# real training, start from scratch
python train.py --batch-size 8 --epochs 100 --data data.yaml --name <model name> --cfg <choice of model size>.yaml --patience 5 

# real training, on pretrained model
python train.py --batch-size 8 --epochs 100 --data data.yaml --name <model name> --weights <choice of model size>.pt --patience 5

NOTE: Like mentioned at the beginning of this section, YOLOv5 (and other versions of YOLO also) has different model size. In short, different model size means different configurations and parameter size.
But larger model doesn't guarantee better performance. Based on our experiment, using a larger model on a relatively samll dataset could lead to worse performance or even not converging.

Check training performance

# still inside yolo dir
cd runs/train/<TestModel>

This folder contains different training and validation metrics.
In particular, results.png tells you whether the training works fine.
For other metrics, you may search them on Google or YOLOv5 official site

Export trained model to ONNX format

# still inside yolo dir
python export.py --weights runs/train/<model name>/weights/best.pt --include onnx --simplify

# either move best.onnx to ROOT or change the path in pred_runner.py (see section below)
mv runs/train/<model name>/weights/best.onnx ./..

Testing

We built the prediction pipeline as a class definition (called Yolo_Predictor) in prediction.py
This also serves as the backend script of our web-app.
You are welcome to look at our implementation detail, but you don't need to modify it.

Instead, just edit the path variables inside pred_runner.py and run it, super easy.
2 pop-up windows will be shown, one being original image, the other being prediction image with bbox, class, and probability.
Press ESC key and they will close.

cd .. # back to our repo
python pred_runner.py

Name		Name	Last commit message	Last commit date
Latest commit History 69 Commits
.streamlit		.streamlit
example_results		example_results
pages		pages
video_data		video_data
.gitignore		.gitignore
Home.py		Home.py
README.md		README.md
__init__.py		__init__.py
best.onnx		best.onnx
data.yaml		data.yaml
demo.mp4		demo.mp4
packages.txt		packages.txt
pred_runner.py		pred_runner.py
predict.py		predict.py
prediction.ipynb		prediction.ipynb
prediction.py		prediction.py
prepare.ipynb		prepare.ipynb
requirements.txt		requirements.txt
train.sh		train.sh
video_runner.py		video_runner.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Object Detection with YOLO-V5 Implementation

Authors

Screenshots

Update 1.0: Video Detection

Roadmap

Usage

Installation & Explanation

Presumption: your device has GPU and a Python environment that can use your GPU.

Clone our repo (or make your own, store your custom dataset, and write corresponding image processing scripts)

Clone YOLOv5 or your choice of version inside our repo

Install requirements.txt

Create a data.yaml file inside yolo folder

Test Training & Training

Check training performance

Export trained model to ONNX format

Testing

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

Thomaswang0822/Yolo_proj

Folders and files

Latest commit

History

Repository files navigation

Object Detection with YOLO-V5 Implementation

Authors

Screenshots

Update 1.0: Video Detection

Roadmap

Usage

Installation & Explanation

Presumption: your device has GPU and a Python environment that can use your GPU.

Clone our repo (or make your own, store your custom dataset, and write corresponding image processing scripts)

Clone YOLOv5 or your choice of version inside our repo

Install requirements.txt

Create a data.yaml file inside yolo folder

Test Training & Training

Check training performance

Export trained model to ONNX format

Testing

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages