LeNet-5 on ZYBO Z7-10 FPGA

Details are organized in LeNet-5.pdf.

Directories

python

Train the original model and light model.
Get the data from trained model such as weights and biases.
Get the input data to be tested and the answer.

data

Created by /python/*_training.ipynb and /python/get_data_*.ipynb.
Trained weights and biases are stored as text. (*.data)
Weights and biases for original model are in /data/origin.
Weights and biases for light model are in /data/lite.

cplusplus

Implemented according to each model and data type.
origin : Original model with floating point data type.
floating-point : Light model with floating point data type.
fixed-point : Light model with fixed point data type.
hls-stream : Light model with hls::stream data type.
hls-parallel : Partially parallel model with hls::stream data type.
etc : Analyze result, save input txt file as a binary, determine number of fixed point interger part, or generate weight array.

hls

dtype : Compare floating point and fixed point in terms of latency, resource, max frequency through simple function.
predict : Partially parallel model that can be synthesized with HLS, that the top function name is predict.

arm

SW driver code that read input, check accuracy, measure latency.
Read input binary file through SD card.
Run 10,000 cases to measure accuracy.
Use the AXI Timer IP on the PL (Programmable Logic) side to measure the latency.

etc

waveform : Waveform obtained through Integrated Logic Analyzer (ILA).
BlockDesign.pdf : Block design of programmable logic.
LeNet-5.pdf : Presentation pdf file that summarizes the content of this LeNet-5 project.

Usage

python

Tensorflow (2.10.0) must be installed in advance.
Since the extension of the file is *.ipynb, if you use vscode, it is recommended to install Jupyter extension.
If you run all the *.ipynb files, /data folder will be generated.

cplusplus

Each directories can be compiled by the command make.
In most cases, it can be run by the command ./main.
Exceptionally, there are three options for floating-point directories.
- Run just one input.
  - ./main --input ../../data/input_N.data
  - ./main -i ../../data/input_N.data
- Run one input, and print intermediate results.
  - ./main --input ../../data/input_N.data --print
  - ./main -i ../../data/input_N.data -p
- Run all (Check accuracy and find Max/Min value of intermediate outputs, weights and biases).
  - ./main --all
  - ./main -a

Block Design

The AXI DMA provides high-speed data movement between system memory and an AXI4-Stream based target IP.
The Integrated Logic Analyzer (ILA) IP core is logic analyzer core that can be used to monitor the internal signals of a design.
The AXI Timer provides an AXI4-Lite interface to communicate with the PS (Processing System).
Interrupt is driven when the ap_done block level interface signal is active High.

Overview of HW Design & SW Driver

Initialize AXI DMA IP and predict IP through dmaInit() and predictInit().
Flush the cache before transferring data via DMA through cacheFlush().
AXI DMA IP reads data through DDR and transfers it to predict IP through dataTx().
Wait for the preidct IP to process, and read the result when interrupt signal is raised.

Performance (ARM core vs. predict IP)

Predict IP on PL is 40.34x faster than Original with -O0 compile option on PS.
Predict IP on PL is 7.38x faster than Original with -O2 compile option on PS.
Predict IP on PL is 16.01x faster than Lite with -O0 compile option on PS.
Predict IP on PL is 1.57x faster than Lite with -O3 compile option on PS.

Issue

SDK issue [Closed] Issue #1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LeNet-5 on ZYBO Z7-10 FPGA

Directories

Usage

Block Design

Overview of HW Design & SW Driver

Performance (ARM core vs. predict IP)

Issue

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 75 Commits
arm		arm
cplusplus		cplusplus
etc		etc
hls		hls
python		python
.gitignore		.gitignore
README.md		README.md

wasinsangdam/LeNet-5

Folders and files

Latest commit

History

Repository files navigation

LeNet-5 on ZYBO Z7-10 FPGA

Directories

Usage

Block Design

Overview of HW Design & SW Driver

Performance (ARM core vs. predict IP)

Issue

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages