Fix training_data
bug in test_ingest_with_training_source_uri_tdb
and validate training data dimensions in ingest()
#175
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
What
When I created
training_data
in the newtest_ingest_with_training_source_uri_tdb()
test, I accidentally did not slice the original data correctly, as it had already been transposed and so I couldn't just slice it. That was leading to us computing a 2x2 centroid instead of a 4x2 centroid, as you can see here:This PR fixes this issue with
training_data
and also adds a check into the code so we raise an exception if this happens. Here is the same test now working correctly (you are looking forcentroids
at the bottom):Testing
Note
I also added a few checks to inputs to
ingest()
- they same safe and potentially useful.