Dear LSST Science Pipelines Team,
I’m a member of the Wide Field Survey Telescope (WFST) data reduction pipeline software team. We are currently upgrading our obs_wfst
package from v24_0_0
to v29_2_0
. So far, we have successfully passed tests for calibration construction
and single-frame processing
.
However, we are encountering intermittent failures during the image subtraction step using lsst.ip.diffim.subtractImages.AlardLuptonSubtractTask
. Some tasks fail with the following error:
{
"name":"lsst.ctrl.mpexec.singleQuantumExecutor",
"asctime":"2025-09-25T02:29:46.123526Z",
"message":"Execution of task 'subtractImages' on quantum {instrument: 'WFC', detector: 18, visit: 118115, band: 'u', day_obs: 20250222, physical_filter: 'WFC-u'} failed. Exception OperationalError: (sqlite3.OperationalError) database is locked\n[SQL: BEGIN IMMEDIATE]\n(Background on this error at: https://sqlalche.me/e/20/e3q8)",
"levelno":40,
"levelname":"ERROR",
"filename":"singleQuantumExecutor.py",
"pathname":"/data/public/lsst_stack_v29_2_0/conda/envs/lsst-scipipe-10.1.0/share/eups/Linux64/ctrl_mpexec/ge10c2aeecd+d8b3cefe0c/python/lsst/ctrl/mpexec/singleQuantumExecutor.py",
"lineno":290,
"funcName":"_execute",
"process":2345353,
"processName":"task-{instrument: 'WFC', detector: 18, visit: 118115, band: 'u', day_obs: 20250222, physical_filter: 'WFC-u'}",
"MDC": {
"LABEL":"subtractImages:{instrument: 'WFC', detector: 18, visit: 118115, band: 'u', day_obs: 20250222, physical_filter: 'WFC-u'}",
"RUN":"WFC/runs/ap/20250925T021706Z"
}
}
The issue appears to be intermittent — we ran the subtraction step across 36 parallel processes three times:
- First run: all succeeded
- Second run: 3 out of 36 failed with the “database is locked” error
- Third run: all succeeded again
This suggests the failure is not deterministic and may be related to concurrent access or file locking.
We would like to ask:
- What is the recommended way to completely avoid this SQLite locking issue (i.e., achieve 100% success rate) in
v29_2_0
? - Could this type of error potentially occur in earlier pipeline stages such as
calibration construction
,single-frame processing
, orcoaddition
?
Thank you for your support and guidance.
Best regards,
Minxuan Cai
WFST Data Reduction Pipeline Team