database: shut the program down immediately if we run out of disk space #2358

kcalvinalvin · 2025-04-14T14:02:00Z

Change Description

Currently btcd keeps downloading blocks and fails to verify them if the disk is out of space. We exit immediately if we detect that the disk is out of space to ensure the database is at a recoverable state later on.

Steps to Test

Tested by using a macbook with full disk and checking if the program shuts down as expected. Verified that it's also able to recover when given enough storage again.

Pull Request Checklist

Testing

Your PR passes all CI checks.
Tests covering the positive and negative (error paths) are included.
Bug fixes contain tests triggering the bug to prevent regressions.

Code Style and Documentation

The change is not insubstantial. Typo fixes are not accepted to fight bot spam.
The change obeys the Code Documentation and Commenting guidelines, and lines wrap at 80.
Commits follow the Ideal Git Commit Structure.
Any new logging statements use an appropriate subsystem and logging level.

📝 Please see our Contribution Guidelines for further guidance.

coveralls · 2025-04-14T14:05:49Z

Pull Request Test Coverage Report for Build 14848505876

Details

2 of 24 (8.33%) changed or added relevant lines in 2 files are covered.
6 unchanged lines in 1 file lost coverage.
Overall coverage increased (+0.1%) to 56.743%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
database/ffldb/blockio.go	2	12	16.67%
database/ffldb/dbcache.go	0	12	0.0%

Files with Coverage Reduction	New Missed Lines	%
peer/peer.go	6	74.23%

Totals
Change from base Build 14316804349:	0.1%
Covered Lines:	31143
Relevant Lines:	54884

💛 - Coveralls

yyforyongyu · 2025-04-14T14:49:05Z

database/ffldb/blockio.go

+			if errno, ok := pathErr.Err.(syscall.Errno); ok &&
+				errno == syscall.ENOSPC {
+
+				log.Errorf("%v. Cannot save any more blocks "+


can we also mention it's due to the disk being full so the operator can act accordingly?

Addressed in the latest commit

yyforyongyu

LGTM🌺

database/ffldb/blockio.go

Currently btcd keeps downloading blocks and fails to verify them if the disk is out of space. We exit immediately if we detect that the disk is out of space to ensure the database is at a recoverable state later on.

starius

LGTM!
I tested this in a small filesystem. Works as expected!

guggero · 2025-05-06T06:49:55Z

Just a side note here: I'm not super familiar with the internals of btcd, so not sure if the same applies here. But in lnd we avoid using os.Exit() because that prevents any defer cleanup statements to run (e.g. closing network or database connections, flushing stuff to disk, closing files and so on).

That's why we're using a ShutdownLogger instead, which causes the main Goroutine to initiate a clean shutdown whenever a message with the level Critical is logged.
Might be worth considering here.

kcalvinalvin · 2025-05-06T06:58:32Z

Just a side note here: I'm not super familiar with the internals of btcd, so not sure if the same applies here. But in lnd we avoid using os.Exit() because that prevents any defer cleanup statements to run (e.g. closing network or database connections, flushing stuff to disk, closing files and so on).

That's why we're using a ShutdownLogger instead, which causes the main Goroutine to initiate a clean shutdown whenever a message with the level Critical is logged. Might be worth considering here.

btcd has one database that handles everything and the flush happens here:

btcd/btcd.go

Lines 133 to 137 in cd05d9a

    
           defer func() { 
        
           	// Ensure the database is sync'd and closed on shutdown. 
        
           	btcdLog.Infof("Gracefully shutting down the database...") 
        
           	db.Close() 
        
           }()

That's a fair point and I did think about returning an error and letting the caller handle the out of disk error but ultimately thought this was the best way to handle things. The database only ever writes to the disk at the places this code change takes place and if that's full, there's nothing else to flush for the caller because anything that the caller tries to flush follows the same code path.

Roasbeef

LGTM 📮

yyforyongyu reviewed Apr 14, 2025

View reviewed changes

saubyk assigned kcalvinalvin Apr 14, 2025

saubyk added this to the v0.25 milestone Apr 14, 2025

saubyk added the blockchain label Apr 14, 2025

kcalvinalvin force-pushed the 2025-02-17-exit-when-running-out-of-disk-space branch from 941bb30 to 8ecdf96 Compare April 21, 2025 11:02

saubyk added the database label Apr 21, 2025

saubyk requested a review from yyforyongyu April 21, 2025 23:39

yyforyongyu approved these changes Apr 22, 2025

View reviewed changes

starius reviewed Apr 29, 2025

View reviewed changes

database/ffldb/blockio.go Outdated Show resolved Hide resolved

kcalvinalvin force-pushed the 2025-02-17-exit-when-running-out-of-disk-space branch from 8ecdf96 to cabd365 Compare May 5, 2025 23:43

kcalvinalvin requested a review from starius May 5, 2025 23:43

starius approved these changes May 6, 2025

View reviewed changes

Roasbeef approved these changes May 6, 2025

View reviewed changes

Roasbeef merged commit 1eb974a into btcsuite:master May 6, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

database: shut the program down immediately if we run out of disk space #2358

database: shut the program down immediately if we run out of disk space #2358

Uh oh!

kcalvinalvin commented Apr 14, 2025

Uh oh!

coveralls commented Apr 14, 2025 •

edited

Loading

Uh oh!

yyforyongyu Apr 14, 2025

Uh oh!

kcalvinalvin Apr 21, 2025

Uh oh!

yyforyongyu left a comment

Uh oh!

Uh oh!

starius left a comment

Uh oh!

guggero commented May 6, 2025 •

edited

Loading

Uh oh!

kcalvinalvin commented May 6, 2025

Uh oh!

Roasbeef left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

database: shut the program down immediately if we run out of disk space #2358

database: shut the program down immediately if we run out of disk space #2358

Uh oh!

Conversation

kcalvinalvin commented Apr 14, 2025

Change Description

Steps to Test

Pull Request Checklist

Testing

Code Style and Documentation

Uh oh!

coveralls commented Apr 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Test Coverage Report for Build 14848505876

Details

💛 - Coveralls

Uh oh!

yyforyongyu Apr 14, 2025

Choose a reason for hiding this comment

Uh oh!

kcalvinalvin Apr 21, 2025

Choose a reason for hiding this comment

Uh oh!

yyforyongyu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

starius left a comment

Choose a reason for hiding this comment

Uh oh!

guggero commented May 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kcalvinalvin commented May 6, 2025

Uh oh!

Roasbeef left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

coveralls commented Apr 14, 2025 •

edited

Loading

guggero commented May 6, 2025 •

edited

Loading