Bug in Place Model #187

rmayore · 2024-12-25T08:59:12Z

Describe the bug
https://github.com/medic/cht-pipeline/blob/main/models/contacts/place.sql

The accepted way to get the contact of a place, is couchdb.doc #>> '{contact,_id}', rather than couchdb.doc->>'contact_id'. You can refer to go to any cht couchdb medic database and look at a place document... or look at the legacy contactview_metadata views.

Similarly, the correct way to get the place of a non-place contact is couchdb.doc #>> '{parent,_id}' rather than couchdb.doc->>'place_id'. Still I'm not sure why a place table would have a place_id field.... presumably place.uuid is the id of the place so what exactly is place.place_id? Is it meant to be the parent place of the place?

Anyways, as a consequence, for cht pipelines that use this repo as a base the place table has null place_id and contact_id fields. I've confirmed this in both local setups and the prod KE MOH database that uses cht sync pipeline. With those fields being null, the extra overhead to join the raw couchdb table again just to get the contact of a place is massive.

To Reproduce

Step 1
Step 2

Expected behavior

Logs

Screenshots

Environment

Instance
Client platform: (eg: Windows, MacOS, Linux)

Additional context

The text was updated successfully, but these errors were encountered:

witash · 2025-01-13T13:41:47Z

@rmayore

for place_id some configurations have a separate field called place_id which is separate from the _id of the place document in couchdb. If present, it will be extracted from the document and indexed, but if absent, it will be NULL. This is fine, it is optional if the document _id is sufficient and prefferred to uniquely identify the place.

For contact_id, I'm not sure if there's any cht instance that has a separate contact_id field. But even if there is, I think you're right that doc->contact->>_id would generally be more useful.

we can maybe coalesce all the variations like in

cht-pipeline/models/forms/data_record.sql

Line 27 in 7bb002b

COALESCE(

in the meantime, joining back to the couchdb table shouldn't have massive overhead unless the result set is large. Do you have an example model that is not performant due to this issue?

rmayore added the Type: Bug Fix something that isn't working as intended label Dec 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug in Place Model #187

Bug in Place Model #187

rmayore commented Dec 25, 2024 •

edited

Loading

witash commented Jan 13, 2025

Bug in Place Model #187

Bug in Place Model #187

Comments

rmayore commented Dec 25, 2024 • edited Loading

witash commented Jan 13, 2025

rmayore commented Dec 25, 2024 •

edited

Loading