I noticed two issues in the 201_8_SSSource_table notebook.
The text states “Check if any visits had more than one source detected” but
the following cell actually prints the number of sources (i.e. detections)
for a given ssObjectID.
The cross-match between the DiaSource and the SSSource tables is joined on
the ssObjectId colum, which leads to duplicated rows as a given ssObjectId
appears multiple times (the number of detections) in DiaSource. For example,
ssObjectId 20890932542191434 has three sources but appears nine times in the
cross-match. The resulting table is thus 142640 rows (sum of squared number
of sources per SSO) when it should be 5988 (sum of SSO sources). Joining on
diaSourceId instead of ssObjectId fixes this issue.
For #2 above, you are correct that the join statement used in the screenshot shown will return query results with all ssObjectId sources getting duplicated for each entry of that ssObjectId in the DiaSource table, which is not the desired result. We are implenting the necessary fix to change the JOIN statement here to read
“JOIN dp1.DiaSource AS dia ON sss.diaSourceId = dia.diaSourceId” in tutorial notebook 201_8_SSSource_table.ipynb as you correctly suggested will fix the issue. Thank you for bringing this to our attention.
For #1 above, there was a small typo in the text description for this cell. The text should say “objects” instead of “visits”. We have implemented a change so that the text now reads:
“Check if any SS objects had more than one SS source detected…”
This cell is, indeed, returning the SS objects that had more than one source detection. Running this cell returns a count of 417 Solar System objects with >1 detection out of the 431 Solar System objects included in the DP1 tables. This means that 14 Solar System objects had only 1 source detection.
Again, thank you for bringing these issues to our attention. We are in the process of implementing the changes and will report back when the new version of the notebook is available.