-
Notifications
You must be signed in to change notification settings - Fork 314
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix: Add error reporting for Retrieval #873
fix: Add error reporting for Retrieval #873
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Great start! I think we can keep this PR's scope within retrieval first, then extend it to other tasks in a separate PR.
Could you see if you can get it running via command line, e.g. mteb -m intfloat/multilingual-e5-base -t kardosdrur/estonian-qa --export_errors
? That way:
a) we know that it works, and
b) the kwargs would be properly passed down. Feel free to start from mteb/cmd.py
Co-authored-by: Isaac Chung <chungisaac1217@gmail.com>
@isaac-chung , Great! Happy to work on this and for other tasks. Does
I think kwargs like save_predictions are not supported in the cmd line, would we need a change like #877 |
My suggestion is to add stuff like the command line option in this PR. At the end of the day, we want to be sure that whatever changes made will work. So that could be an added test, or extending the needed components to support running it on the command line.
Oh my bad. Should have used the task name, not the dataset name. Try |
@isaac-chung Great thanks, I added kwargs option (works for save_predictions as well) in this PR and checked it works with |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for iterating! Just a few small comments. I'll also enable the tests now.
Co-authored-by: Isaac Chung <chungisaac1217@gmail.com>
@isaac-chung thanks, changed kwargs. |
@jordiclive and @isaac-chung just letting you both know that I made an overhaul to the cli in #882. From my reading of this PR though they should still be compatible (I can just add the arguments into the mteb run parser). However, when reading through it does seem problematic to add a flag that only works for some of the tasks. Instead, I would add in the non-cli changes, making it possible to create a Python script for running it locally (or alternatively the new CLI construction can be extended in a python script):
In the future, we can then fully fledge out the flag such that errors are saved for all task types. Let me know what you think about this - the intend here is to make the CLI more transparent not to remove flexibility from the user. |
@KennethEnevoldsen ok so remove the cli? Just like save_predictions cannot be enabled through the cmd line and is a kwarg for Yes, I was planning to try and implement this feature for the other tasks. OK cool with your PR, good to neaten up cmd line and documentation—I think I tried the current main this morning and there were some issues with normal running and output folder. |
Yea remove it for now and then we can add it back into the CLI, when it is available for all tasks. If you experience any issues, please submit an issue and we will make sure to take a look at it. |
ok, done. Will do. |
@KennethEnevoldsen Yes that makes sense. I originally thought the CLI would have been a convenient avenue to test out the non-cli changes. should have stuck with the test suite 😅 thanks for taking a look! |
@jordiclive thanks for your contributions! I'll merge this in now. |
Adding Error reporting as discussed in #867.
(- Add cmd line kwargs) cmd line is under overhaul, will update if gets added to all tasks.