JustNoise

Classroom Occupancy and Noise Analytics

This project contains firmware and software for an IoT system that senses classroom audio, occupancy and environment, builds a noise profile, and issues actuation decisions.

NEW: WAV Audio Recording Over Serial - The ESP32 can now record 10-second WAV files and stream them directly to your computer over USB. See the "WAV Recording" section below.

NEW: Real-time Voice Detection - Silero VAD integration for live speech detection with visual CLI monitor. See the "Voice Activity Detection" section below.

See AGENTS.md for complete project details.

Directories

esp32/firmware: PlatformIO project for ESP32 sensor node firmware (MQTT mode).
arduino/mictest: Simplified ESP32 sketch for WAV file recording over serial.
pi-aggregator: Raspberry Pi aggregator code that maintains noise profiles and publishes metadata.
pi-decision: Raspberry Pi decision node with ML model to predict noisiness and publish actuation commands.
shared: Shared MQTT topics and schemas and utilities.
scripts: Helper scripts including WAV capture utility.

Audio Streaming (Serial & WiFi TCP)

The ESP32 supports two streaming modes:

Serial Mode (USB)

Quick Start:

Hardware Setup:
- Connect MAX4466 microphone to ESP32 GPIO 35
- Connect ESP32 to computer via USB

Flash Firmware:

cd arduino/mictest
pio run -t upload --upload-port /dev/tty.wchusbserial550D0193611  # macOS
# or /dev/ttyUSB0 on Linux

Capture Audio:

uv run scripts/capture_wav.py /dev/tty.wchusbserial550D0193611 recording.wav

Play Recording:

afplay recording.wav  # macOS
# or aplay recording.wav on Linux

WiFi TCP Mode (Network Streaming)

Features:

Automatic WiFi connection to "yours" network
TCP streaming to server at 10.45.232.125:8080
Real-time audio streaming over network
Configurable gain via TCP commands

Setup:

Flash firmware (same as above)
ESP32 automatically connects to WiFi "yours" with password "yours123"
Start streaming by sending 'S' command over serial
Server receives raw PCM audio at 10.45.232.125:8080

Commands:

S - Start TCP streaming
T - Stop streaming
G0-G4 - Set gain (0=1x, 1=2x, 2=4x, 3=8x, 4=16x)
I - Show status

Output:

Format: Raw 16-bit mono PCM (no WAV header)
Sample Rate: 16 kHz
Bitrate: ~256 kbps
Protocol: TCP stream to configured server

Microphone Gain Troubleshooting

Problem: Audio sounds extremely loud and distorted ("earrape")?

Solution: The ESP32 firmware defaults to 16x gain (G4), which may be too high for your microphone setup.

Quick Fix:

# Check current gain
just mic-gain-test 3  # Test with 8x gain

# Set lower gain if still too loud
just mic-gain-set 2   # 4x gain (recommended for most setups)
just mic-gain-set 1   # 2x gain (for very sensitive microphones)
just mic-gain-set 0   # 1x gain (minimum, for loud environments)

Gain Levels:

G0: 1x gain (minimum amplification)
G1: 2x gain (minimal)
G2: 4x gain (light - recommended for most setups)
G3: 8x gain (medium)
G4: 16x gain (high - default, may cause distortion)

Testing Audio Quality:

# Capture a test recording
just capture-pcm-duration 3

# Play it back
just play-last

# Adjust gain and repeat until audio sounds clear

Note: Gain can be adjusted in real-time during streaming. Send 'G' followed by the gain level (0-4) to change gain on-the-fly.

Voice Activity Detection (VAD)

Quick Start

Install VAD dependencies:

just setup-vad  # Installs PyTorch + Silero VAD

Test installation:
```
just test-vad
```

Live vocal monitoring (CLI debugger):

# Single recording (10 seconds)
just vad-monitor

# Continuous mode (loops indefinitely)
just vad-monitor-continuous

This displays real-time visual alerts when vocals are detected:

🔴 Big alerts when speech starts
🟢 Notification when speech ends
Progress bars showing confidence during speech
Session summary with statistics
🔄 Continuous mode keeps monitoring across multiple recordings

Example output:

🗣️  🗣️  🗣️  🗣️  🗣️  🗣️  🗣️  🗣️  🗣️  🗣️  
[12:34:56.789] 🔴 VOCALS DETECTED - SPEECH STARTED!
🗣️  🗣️  🗣️  🗣️  🗣️  🗣️  🗣️  🗣️  🗣️  🗣️  

🔴 SPEECH [████████████████████░░░░░░░░░░░░░░░░] 65.3%

MQTT mode (production):

just vad-live  # Publishes events to MQTT

Features

✅ Privacy-first: No raw audio transmitted, only metadata
✅ Low latency: ~32ms detection delay
✅ Accurate: Silero VAD model (state-of-the-art)
✅ Visual feedback: CLI monitor for debugging

See pi-aggregator/VAD_README.md for complete documentation.

MQTT Mode (IoT System)

Quick start (development):

ESP32: Use PlatformIO to build and flash esp32/firmware.
Python components: This project uses uv for dependency management.
1. Install uv: curl -LsSf https://astral.sh/uv/install.sh | sh (or via brew/pip).
2. Sync dependencies: uv sync.

Local test (no hardware):

Run a local MQTT broker (mosquitto): brew install mosquitto then mosquitto.
In one terminal run the aggregator: uv run pi-aggregator/aggregator.py.
In another terminal run the decision node: uv run pi-decision/decision.py.
Use the included simulator to publish sample ESP32 messages: uv run scripts/publish_sample.py.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
arduino		arduino
docs		docs
esp32/firmware		esp32/firmware
peaks		peaks
pi-aggregator		pi-aggregator
pi-decision		pi-decision
scripts		scripts
shared		shared
.DS_Store		.DS_Store
.env.example		.env.example
.gitignore		.gitignore
AGENTS.md		AGENTS.md
COMPLETION_CHECKLIST.md		COMPLETION_CHECKLIST.md
GET_STARTED_VAD_GRACE.md		GET_STARTED_VAD_GRACE.md
GRACE_PERIOD_VISUAL.md		GRACE_PERIOD_VISUAL.md
IMPLEMENTATION_SUMMARY.txt		IMPLEMENTATION_SUMMARY.txt
LICENSE		LICENSE
MICROPHONE_GAIN.md		MICROPHONE_GAIN.md
MIC_GAIN_QUICK_START.md		MIC_GAIN_QUICK_START.md
PCM_CAPTURE_FIXED.md		PCM_CAPTURE_FIXED.md
PCM_CAPTURE_QUICK_REF.md		PCM_CAPTURE_QUICK_REF.md
PCM_CAPTURE_SUMMARY.md		PCM_CAPTURE_SUMMARY.md
Project Proposal.pdf		Project Proposal.pdf
QUICK_START_RAW_PCM.md		QUICK_START_RAW_PCM.md
QUICK_VAD_GRACE.md		QUICK_VAD_GRACE.md
RAW_PCM_REFACTOR.md		RAW_PCM_REFACTOR.md
README.md		README.md
REFACTOR_REFERENCE.sh		REFACTOR_REFERENCE.sh
VAD_GRACE_CHANGES.md		VAD_GRACE_CHANGES.md
VAD_GRACE_PERIOD_GUIDE.md		VAD_GRACE_PERIOD_GUIDE.md
VOLUME_COMMAND_SYNTAX.md		VOLUME_COMMAND_SYNTAX.md
VOLUME_CONTROL.md		VOLUME_CONTROL.md
VOLUME_CONTROL_QUICK_START.md		VOLUME_CONTROL_QUICK_START.md
justfile		justfile
pin_diagram_esp.png		pin_diagram_esp.png
pyproject.toml		pyproject.toml
recording.wav		recording.wav
recording_dummy.wav		recording_dummy.wav
recording_fixed.wav		recording_fixed.wav
recording_i2s_32bit_left.wav		recording_i2s_32bit_left.wav
recording_pdm.wav		recording_pdm.wav
recording_real.wav		recording_real.wav
recording_timer.wav		recording_timer.wav
recording_triggered.wav		recording_triggered.wav
test_recording.wav		test_recording.wav
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

JustNoise

Directories

Audio Streaming (Serial & WiFi TCP)

Serial Mode (USB)

WiFi TCP Mode (Network Streaming)

Microphone Gain Troubleshooting

Voice Activity Detection (VAD)

Quick Start

Features

MQTT Mode (IoT System)

About

Uh oh!

Releases

Packages

Languages

License

ArikRahman/JustNoise

Folders and files

Latest commit

History

Repository files navigation

JustNoise

Directories

Audio Streaming (Serial & WiFi TCP)

Serial Mode (USB)

WiFi TCP Mode (Network Streaming)

Microphone Gain Troubleshooting

Voice Activity Detection (VAD)

Quick Start

Features

MQTT Mode (IoT System)

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages