Add neighborhood csv submission instructions#801
Conversation
There was a problem hiding this comment.
I might be missing something. I have a few questions and a suggestion:
- The
generate_neighborhood_json.pyscript seems to expect some image related columns, but they are not in the readme. I wonder if those columns are added byfetch_images.pyin the processing pipeline and are thus not required in the CSV. Is this the case? - The
generate_neighborhood_json.pyscript removes x, y, lat, lon from the column list. The README documents lat and lon but not x and y. I presume they are added byadd_zcta_centroids.py. I'd like to verify if this understanding is correct. - The
generate_neighborhood_json.pyscript casts type ofzipcodeto string, but the readme has this field asNumber. I wonder if this could be that Boston zip code can be like02108and thus we needed it to bestrso that the leading 0 won't get lost. If this is the case, then we might want to change the data type fromNumbertoTextforzipcodein the readme. - Suggestion: the
README.mdat the root of the repo has a great section for Data. I'd like to suggest we add a sentence to point to this new readme text file that this should be referenced as the CSV schema. What do you think?
|
Thanks for the review, @aaronxsu! Answering your questions below:
Yes, those extra image metadata columns are generated in
Yup similar to the above,
That's a good point, number felt intuitive for the BHA-side to interpret but we do want to preserve those starting
That makes sense! Just added that context in 25b8e3f, let me know what you think! |
aaronxsu
left a comment
There was a problem hiding this comment.
Thanks for the replies to my questions and for the changes. They make sense to me.
This PR looks good and is good to go!
25b8e3f to
3acd3af
Compare
|
Thanks for the review! |
Overview
Adds a README to the
neighborhood_datadir for more specific neighborhood data template instructions. Intended to be used as reference for us as well as client when sending over csv template to prevent loss of context between data updates.See #788 (comment) for background.
Checklist
fixup!commits have been squashedCHANGELOG.mdupdated with summary of features or fixes, followingKeep a Changelog guidelines
README.mdupdated if necessary to reflect the changes./scripts/formatto lint, format, and fix the application source code.Testing Instructions
neighborhood_data/generate_neighborhood_jsonscript to confirm parityPartially resolves #788