Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Evaluation with multiple GPUs #31

Open
lucasjinreal opened this issue Mar 25, 2024 · 13 comments
Open

Evaluation with multiple GPUs #31

lucasjinreal opened this issue Mar 25, 2024 · 13 comments

Comments

@lucasjinreal
Copy link

Currently using multiple GPU to eval, the model will load into serveral devices, caused inference error.

How to force them eval one task in parallel on multiple device?

@kcz358
Copy link
Collaborator

kcz358 commented Mar 25, 2024

Hi @lucasjinreal , you can refer to #12 for more details

@lucasjinreal
Copy link
Author

lucasjinreal commented Mar 25, 2024

@kcz358 Hi, the issue linked seems are runing one model with tensor parallel on 2 GPU.

But my request is doing evaluation same time on 8GPUs, my model is small and I want all GPUs runing at the same time, then gathering the results togather, does it support for now? How to enable that?

@kcz358
Copy link
Collaborator

kcz358 commented Mar 25, 2024

This seems like you should use accelerate launch --num_processes=8

@lucasjinreal
Copy link
Author

Am using the main.py directly, since I need my customized llava, does there any other way didn't need to involve with accelerate?

@kcz358
Copy link
Collaborator

kcz358 commented Mar 26, 2024

If you does not want to use accelerate, you might also want to try torchrun. However, this is not tested and you might want figure out how to make it work.

@lucasjinreal
Copy link
Author

lucasjinreal commented Mar 26, 2024

Got it!

However, I met another problem, when eval mmbench, I got Nan result , while on cmmmu are normal:

num_fewshot: None,
batch_size: 1
|      Tasks       |Version|Filter|n-shot|  Metric  |Value|   |Stderr|
|------------------|-------|------|-----:|----------|-----|---|------|
|mmbench           |N/A    |none  |     0|submission|N/A  |±  |N/A   |
| - mmbench_cn_cc  |Yaml   |none  |     0|submission|N/A  |±  |N/A   |
| - mmbench_cn_dev |Yaml   |none  |     0|submission|N/A  |±  |N/A   |
| - mmbench_cn_test|Yaml   |none  |     0|submission|N/A  |±  |N/A   |
| - mmbench_en_dev |Yaml   |none  |     0|submission|N/A  |±  |N/A   |
| - mmbench_en_test|Yaml   |none  |     0|submission|N/A  |±  |N/A   |

|Groups |Version|Filter|n-shot|  Metric  |Value|   |Stderr|
|-------|-------|------|-----:|----------|-----|---|------|
|mmbench|N/A    |none  |     0|submission|N/A  |±  |N/A   |

any help?

@kcz358
Copy link
Collaborator

kcz358 commented Mar 26, 2024

Hi @lucasjinreal , you can see the metric is submission, which means you need to submit the files to their website by yourself to get the score

You can find the submission file in your log file

@lucasjinreal
Copy link
Author

For mmbench, I saw the result file, it has prediciton and gt, why it can not calculate a score offline?

@Luodian
Copy link
Contributor

Luodian commented Apr 2, 2024

For mmbench, I saw the result file, it has prediciton and gt, why it can not calculate a score offline?

MMBench is designed for submission only, you need to submit the result file to their official site.

@lucasjinreal
Copy link
Author

No, for dev it's certainly not.

I am not refering to test.

@Luodian
Copy link
Contributor

Luodian commented Apr 2, 2024

No, for dev it's certainly not.

I am not refering to test.

oh yes you're right. we will plan the update to enable report mmbench-dev score

@lucasjinreal
Copy link
Author

thanks, hoping for it

Luodian added a commit that referenced this issue Apr 4, 2024
* Remove unused code and configuration file

* Remove docvqa.yaml and update vizwizvqa.yaml

* lint

* Add dataset_kwargs to vizwizvqa.yaml

* Add dataset_kwargs to vizwizvqa.yaml

* textvqa (#27)

* Update textvqa.yaml and utils.py

* Fix YAML formatting in textvqa.yaml and remove unused files

* remove useless matric

* add textvqa val & test

* Update progress bar description in evaluator.py

* Update submission file names in VizWizVQA tasks

* Update output path to include log samples suffix

* Update submission file paths in OKVQA and VizWizVQA tasks

* Refactor llava-in-the-wild.yaml and utils.py

* Update metric for llava evaluation

* Refactor logging message in Task class

* Merge commit 'f80465fd0f30781c8c36b46c1d6d7bba751f9e33'

* Fix formatting issues and add progress bar closing statements

* Update task from "infovqa_val" to "infovqa_test" in infovqa_test.yaml

* Update tqdm progress bar in OtterHD model

* Squashed commit of the following:

commit af6c7a2b8c2959495dc351e6f6eb2a442efe4e94
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:46:19 2024 +0800

    Black lint

commit 26da729c40008f72ce3f10c932874f120f290e26
Merge: ab898e4 fb209e4
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:45:31 2024 +0800

    Merge branch 'main' into kc/list_tasks_num

commit acbb1a1997c5159709e3b81c3f0292b2f9def109
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:44:23 2024 +0800

    Enable list all tasks num

commit b33ac32f0ff28777204eaaf27a963200024081df
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:41:32 2024 +0800

    Exclude train yaml file in the task list

commit f80465f
Author: Zhang Peiyuan <a1286225768@gmail.com>
Date:   Sun Jan 28 02:04:57 2024 +0800

    Add InfoVQA, DocVQA, and QwenVL (#28)

    * add mmme

    * black

    * add model specific prompt and gen kwargs

    * black

    * add yaml config to supprot multi-model eval

    * print table at the end

    * refactor multi model code

    * add chartqa

    * black

    * add ai2d

    * black

    * update chartqa

    * blacl

    * update ai2d dataset

    * black

    * add qwenvl

    * add infovqa and docvqa

* Fix error handling in loading YAML config files

* Squashed commit of the following:

commit 2df0ce76ef836be1cb8ffbf3c854fe05563647b0
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 12:41:40 2024 +0800

    Fix key bugs

commit af6c7a2b8c2959495dc351e6f6eb2a442efe4e94
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:46:19 2024 +0800

    Black lint

commit 26da729c40008f72ce3f10c932874f120f290e26
Merge: ab898e4 fb209e4
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:45:31 2024 +0800

    Merge branch 'main' into kc/list_tasks_num

commit acbb1a1997c5159709e3b81c3f0292b2f9def109
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:44:23 2024 +0800

    Enable list all tasks num

commit b33ac32f0ff28777204eaaf27a963200024081df
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:41:32 2024 +0800

    Exclude train yaml file in the task list

commit f80465f
Author: Zhang Peiyuan <a1286225768@gmail.com>
Date:   Sun Jan 28 02:04:57 2024 +0800

    Add InfoVQA, DocVQA, and QwenVL (#28)

    * add mmme

    * black

    * add model specific prompt and gen kwargs

    * black

    * add yaml config to supprot multi-model eval

    * print table at the end

    * refactor multi model code

    * add chartqa

    * black

    * add ai2d

    * black

    * update chartqa

    * blacl

    * update ai2d dataset

    * black

    * add qwenvl

    * add infovqa and docvqa

* List task #num sorted

* Update prompt messages for image-related tasks

* Delete unused task configuration files

* Remove coco_train.yaml configuration file

* Update task name in mmmu.yaml

* Fix error message for missing tasks

* Add wandb import and integration

* Update generation kwargs for LMMS tasks

* Update lmms_eval MME task configuration and utils

* Update generation_kwargs in lmms_eval tasks

* Update doc_to_text function in coco and okvqa tasks

* Add COCO 2017 version

* Update task name in coco_test2017.yaml

* Squashed commit of the following:

commit 0390783
Author: Zhang Peiyuan <a1286225768@gmail.com>
Date:   Mon Jan 29 22:41:33 2024 +0800

    Add/mmmu test (#30)

    * mmmu_test

    * black

commit 407bc25
Author: Li Bo <drluodian@gmail.com>
Date:   Sun Jan 28 22:19:13 2024 +0800

    [Dataset Check] dataset check and add wandb logging (#29)

    * Remove unused code and configuration file

    * Remove docvqa.yaml and update vizwizvqa.yaml

    * lint

    * Add dataset_kwargs to vizwizvqa.yaml

    * Add dataset_kwargs to vizwizvqa.yaml

    * textvqa (#27)

    * Update textvqa.yaml and utils.py

    * Fix YAML formatting in textvqa.yaml and remove unused files

    * remove useless matric

    * add textvqa val & test

    * Update progress bar description in evaluator.py

    * Update submission file names in VizWizVQA tasks

    * Update output path to include log samples suffix

    * Update submission file paths in OKVQA and VizWizVQA tasks

    * Refactor llava-in-the-wild.yaml and utils.py

    * Update metric for llava evaluation

    * Refactor logging message in Task class

    * Merge commit 'f80465fd0f30781c8c36b46c1d6d7bba751f9e33'

    * Fix formatting issues and add progress bar closing statements

    * Update task from "infovqa_val" to "infovqa_test" in infovqa_test.yaml

    * Update tqdm progress bar in OtterHD model

    * Squashed commit of the following:

    commit af6c7a2b8c2959495dc351e6f6eb2a442efe4e94
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:46:19 2024 +0800

        Black lint

    commit 26da729c40008f72ce3f10c932874f120f290e26
    Merge: ab898e4 fb209e4
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:45:31 2024 +0800

        Merge branch 'main' into kc/list_tasks_num

    commit acbb1a1997c5159709e3b81c3f0292b2f9def109
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:44:23 2024 +0800

        Enable list all tasks num

    commit b33ac32f0ff28777204eaaf27a963200024081df
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:41:32 2024 +0800

        Exclude train yaml file in the task list

    commit f80465f
    Author: Zhang Peiyuan <a1286225768@gmail.com>
    Date:   Sun Jan 28 02:04:57 2024 +0800

        Add InfoVQA, DocVQA, and QwenVL (#28)

        * add mmme

        * black

        * add model specific prompt and gen kwargs

        * black

        * add yaml config to supprot multi-model eval

        * print table at the end

        * refactor multi model code

        * add chartqa

        * black

        * add ai2d

        * black

        * update chartqa

        * blacl

        * update ai2d dataset

        * black

        * add qwenvl

        * add infovqa and docvqa

    * Fix error handling in loading YAML config files

    * Squashed commit of the following:

    commit 2df0ce76ef836be1cb8ffbf3c854fe05563647b0
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 12:41:40 2024 +0800

        Fix key bugs

    commit af6c7a2b8c2959495dc351e6f6eb2a442efe4e94
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:46:19 2024 +0800

        Black lint

    commit 26da729c40008f72ce3f10c932874f120f290e26
    Merge: ab898e4 fb209e4
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:45:31 2024 +0800

        Merge branch 'main' into kc/list_tasks_num

    commit acbb1a1997c5159709e3b81c3f0292b2f9def109
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:44:23 2024 +0800

        Enable list all tasks num

    commit b33ac32f0ff28777204eaaf27a963200024081df
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:41:32 2024 +0800

        Exclude train yaml file in the task list

    commit f80465f
    Author: Zhang Peiyuan <a1286225768@gmail.com>
    Date:   Sun Jan 28 02:04:57 2024 +0800

        Add InfoVQA, DocVQA, and QwenVL (#28)

        * add mmme

        * black

        * add model specific prompt and gen kwargs

        * black

        * add yaml config to supprot multi-model eval

        * print table at the end

        * refactor multi model code

        * add chartqa

        * black

        * add ai2d

        * black

        * update chartqa

        * blacl

        * update ai2d dataset

        * black

        * add qwenvl

        * add infovqa and docvqa

    * List task #num sorted

    * Update prompt messages for image-related tasks

    * Delete unused task configuration files

    * Remove coco_train.yaml configuration file

    * Update task name in mmmu.yaml

    * Fix error message for missing tasks

    * Add wandb import and integration

    ---------

    Co-authored-by: Fanyi Pu <FPU001@e.ntu.edu.sg>
    Co-authored-by: kcz358 <92624596+kcz358@users.noreply.github.com>

* Refactor CLI evaluate function and improve error logging

---------

Co-authored-by: Fanyi Pu <FPU001@e.ntu.edu.sg>
Co-authored-by: kcz358 <92624596+kcz358@users.noreply.github.com>
Luodian added a commit that referenced this issue Apr 4, 2024
* Remove unused code and configuration file

* Remove docvqa.yaml and update vizwizvqa.yaml

* lint

* Add dataset_kwargs to vizwizvqa.yaml

* Add dataset_kwargs to vizwizvqa.yaml

* textvqa (#27)

* Update textvqa.yaml and utils.py

* Fix YAML formatting in textvqa.yaml and remove unused files

* remove useless matric

* add textvqa val & test

* Update progress bar description in evaluator.py

* Update submission file names in VizWizVQA tasks

* Update output path to include log samples suffix

* Update submission file paths in OKVQA and VizWizVQA tasks

* Refactor llava-in-the-wild.yaml and utils.py

* Update metric for llava evaluation

* Refactor logging message in Task class

* Merge commit '0a403e6f5e17c70a50983c83a132edf0fdcd98de'

* Fix formatting issues and add progress bar closing statements

* Update task from "infovqa_val" to "infovqa_test" in infovqa_test.yaml

* Update tqdm progress bar in OtterHD model

* Squashed commit of the following:

commit eae210c3700a59b7d5cc9de46fcb855f443096aa
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:46:19 2024 +0800

    Black lint

commit 18e4a19e82357352ab25df77b5ae4f1b011d61ae
Merge: ab898e4 fb209e4
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:45:31 2024 +0800

    Merge branch 'main' into kc/list_tasks_num

commit e899be48f55f95172fdf96bd2a98d3b91ff2aaed
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:44:23 2024 +0800

    Enable list all tasks num

commit a999fc6889c6986c28ec5d95460a4ab5233e5d4f
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:41:32 2024 +0800

    Exclude train yaml file in the task list

commit 0a403e6
Author: Zhang Peiyuan <a1286225768@gmail.com>
Date:   Sun Jan 28 02:04:57 2024 +0800

    Add InfoVQA, DocVQA, and QwenVL (#28)

    * add mmme

    * black

    * add model specific prompt and gen kwargs

    * black

    * add yaml config to supprot multi-model eval

    * print table at the end

    * refactor multi model code

    * add chartqa

    * black

    * add ai2d

    * black

    * update chartqa

    * blacl

    * update ai2d dataset

    * black

    * add qwenvl

    * add infovqa and docvqa

* Fix error handling in loading YAML config files

* Squashed commit of the following:

commit fdb0c6785b0c5d6979d10e7ddf75ce9055038db8
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 12:41:40 2024 +0800

    Fix key bugs

commit eae210c3700a59b7d5cc9de46fcb855f443096aa
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:46:19 2024 +0800

    Black lint

commit 18e4a19e82357352ab25df77b5ae4f1b011d61ae
Merge: ab898e4 fb209e4
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:45:31 2024 +0800

    Merge branch 'main' into kc/list_tasks_num

commit e899be48f55f95172fdf96bd2a98d3b91ff2aaed
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:44:23 2024 +0800

    Enable list all tasks num

commit a999fc6889c6986c28ec5d95460a4ab5233e5d4f
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:41:32 2024 +0800

    Exclude train yaml file in the task list

commit 0a403e6
Author: Zhang Peiyuan <a1286225768@gmail.com>
Date:   Sun Jan 28 02:04:57 2024 +0800

    Add InfoVQA, DocVQA, and QwenVL (#28)

    * add mmme

    * black

    * add model specific prompt and gen kwargs

    * black

    * add yaml config to supprot multi-model eval

    * print table at the end

    * refactor multi model code

    * add chartqa

    * black

    * add ai2d

    * black

    * update chartqa

    * blacl

    * update ai2d dataset

    * black

    * add qwenvl

    * add infovqa and docvqa

* List task #num sorted

* Update prompt messages for image-related tasks

* Delete unused task configuration files

* Remove coco_train.yaml configuration file

* Update task name in mmmu.yaml

* Fix error message for missing tasks

* Add wandb import and integration

* Update generation kwargs for LMMS tasks

* Update lmms_eval MME task configuration and utils

* Update generation_kwargs in lmms_eval tasks

* Update doc_to_text function in coco and okvqa tasks

* Add COCO 2017 version

* Update task name in coco_test2017.yaml

* Squashed commit of the following:

commit 1e2ae93
Author: Zhang Peiyuan <a1286225768@gmail.com>
Date:   Mon Jan 29 22:41:33 2024 +0800

    Add/mmmu test (#30)

    * mmmu_test

    * black

commit 10bbaf0
Author: Li Bo <drluodian@gmail.com>
Date:   Sun Jan 28 22:19:13 2024 +0800

    [Dataset Check] dataset check and add wandb logging (#29)

    * Remove unused code and configuration file

    * Remove docvqa.yaml and update vizwizvqa.yaml

    * lint

    * Add dataset_kwargs to vizwizvqa.yaml

    * Add dataset_kwargs to vizwizvqa.yaml

    * textvqa (#27)

    * Update textvqa.yaml and utils.py

    * Fix YAML formatting in textvqa.yaml and remove unused files

    * remove useless matric

    * add textvqa val & test

    * Update progress bar description in evaluator.py

    * Update submission file names in VizWizVQA tasks

    * Update output path to include log samples suffix

    * Update submission file paths in OKVQA and VizWizVQA tasks

    * Refactor llava-in-the-wild.yaml and utils.py

    * Update metric for llava evaluation

    * Refactor logging message in Task class

    * Merge commit '0a403e6f5e17c70a50983c83a132edf0fdcd98de'

    * Fix formatting issues and add progress bar closing statements

    * Update task from "infovqa_val" to "infovqa_test" in infovqa_test.yaml

    * Update tqdm progress bar in OtterHD model

    * Squashed commit of the following:

    commit eae210c3700a59b7d5cc9de46fcb855f443096aa
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:46:19 2024 +0800

        Black lint

    commit 18e4a19e82357352ab25df77b5ae4f1b011d61ae
    Merge: ab898e4 fb209e4
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:45:31 2024 +0800

        Merge branch 'main' into kc/list_tasks_num

    commit e899be48f55f95172fdf96bd2a98d3b91ff2aaed
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:44:23 2024 +0800

        Enable list all tasks num

    commit a999fc6889c6986c28ec5d95460a4ab5233e5d4f
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:41:32 2024 +0800

        Exclude train yaml file in the task list

    commit 0a403e6
    Author: Zhang Peiyuan <a1286225768@gmail.com>
    Date:   Sun Jan 28 02:04:57 2024 +0800

        Add InfoVQA, DocVQA, and QwenVL (#28)

        * add mmme

        * black

        * add model specific prompt and gen kwargs

        * black

        * add yaml config to supprot multi-model eval

        * print table at the end

        * refactor multi model code

        * add chartqa

        * black

        * add ai2d

        * black

        * update chartqa

        * blacl

        * update ai2d dataset

        * black

        * add qwenvl

        * add infovqa and docvqa

    * Fix error handling in loading YAML config files

    * Squashed commit of the following:

    commit fdb0c6785b0c5d6979d10e7ddf75ce9055038db8
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 12:41:40 2024 +0800

        Fix key bugs

    commit eae210c3700a59b7d5cc9de46fcb855f443096aa
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:46:19 2024 +0800

        Black lint

    commit 18e4a19e82357352ab25df77b5ae4f1b011d61ae
    Merge: ab898e4 fb209e4
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:45:31 2024 +0800

        Merge branch 'main' into kc/list_tasks_num

    commit e899be48f55f95172fdf96bd2a98d3b91ff2aaed
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:44:23 2024 +0800

        Enable list all tasks num

    commit a999fc6889c6986c28ec5d95460a4ab5233e5d4f
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:41:32 2024 +0800

        Exclude train yaml file in the task list

    commit 0a403e6
    Author: Zhang Peiyuan <a1286225768@gmail.com>
    Date:   Sun Jan 28 02:04:57 2024 +0800

        Add InfoVQA, DocVQA, and QwenVL (#28)

        * add mmme

        * black

        * add model specific prompt and gen kwargs

        * black

        * add yaml config to supprot multi-model eval

        * print table at the end

        * refactor multi model code

        * add chartqa

        * black

        * add ai2d

        * black

        * update chartqa

        * blacl

        * update ai2d dataset

        * black

        * add qwenvl

        * add infovqa and docvqa

    * List task #num sorted

    * Update prompt messages for image-related tasks

    * Delete unused task configuration files

    * Remove coco_train.yaml configuration file

    * Update task name in mmmu.yaml

    * Fix error message for missing tasks

    * Add wandb import and integration

    ---------

    Co-authored-by: Fanyi Pu <FPU001@e.ntu.edu.sg>
    Co-authored-by: kcz358 <92624596+kcz358@users.noreply.github.com>

* Refactor CLI evaluate function and improve error logging

---------

Co-authored-by: Fanyi Pu <FPU001@e.ntu.edu.sg>
Co-authored-by: kcz358 <92624596+kcz358@users.noreply.github.com>
@hxhcreate
Copy link

Run into the same problem when evaluation with multigpu, looking forward to any assistance, thanks

RROR Error Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cuda:1! (when checking argument for argument weight in method wrapper_CUDA__cudnn_convolution) in generating
05-10 21:54:48 [lmms_eval/models/llava.py:407] ERROR Error Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cuda:3! (when checking argument for argument weight in method wrapper_CUDA__cudnn_convolution) in generating

kangreen0210 pushed a commit to kangreen0210/LIME that referenced this issue Oct 6, 2024
* Remove unused code and configuration file

* Remove docvqa.yaml and update vizwizvqa.yaml

* lint

* Add dataset_kwargs to vizwizvqa.yaml

* Add dataset_kwargs to vizwizvqa.yaml

* textvqa (EvolvingLMMs-Lab#27)

* Update textvqa.yaml and utils.py

* Fix YAML formatting in textvqa.yaml and remove unused files

* remove useless matric

* add textvqa val & test

* Update progress bar description in evaluator.py

* Update submission file names in VizWizVQA tasks

* Update output path to include log samples suffix

* Update submission file paths in OKVQA and VizWizVQA tasks

* Refactor llava-in-the-wild.yaml and utils.py

* Update metric for llava evaluation

* Refactor logging message in Task class

* Merge commit 'ad8d9da1fb40c446202bf9b0095b02262df2ffc8'

* Fix formatting issues and add progress bar closing statements

* Update task from "infovqa_val" to "infovqa_test" in infovqa_test.yaml

* Update tqdm progress bar in OtterHD model

* Squashed commit of the following:

commit c09b621195878300417315a97efdec25e67dd7f5
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:46:19 2024 +0800

    Black lint

commit 864a1aba26388276b7e57717b89520fcc77b3f62
Merge: ab898e4 ad8d9da
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:45:31 2024 +0800

    Merge branch 'main' into kc/list_tasks_num

commit ab898e4fd30bf83888125d48b80bc86b01cb5d39
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:44:23 2024 +0800

    Enable list all tasks num

commit c0ea54d49cb65b747d7e8fccac75838acabe05db
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:41:32 2024 +0800

    Exclude train yaml file in the task list

commit ad8d9da
Author: Zhang Peiyuan <a1286225768@gmail.com>
Date:   Sun Jan 28 02:04:57 2024 +0800

    Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

    * add mmme

    * black

    * add model specific prompt and gen kwargs

    * black

    * add yaml config to supprot multi-model eval

    * print table at the end

    * refactor multi model code

    * add chartqa

    * black

    * add ai2d

    * black

    * update chartqa

    * blacl

    * update ai2d dataset

    * black

    * add qwenvl

    * add infovqa and docvqa

* Fix error handling in loading YAML config files

* Squashed commit of the following:

commit dbba2fe6447b0dfd4bb89a368f62178f2b253006
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 12:41:40 2024 +0800

    Fix key bugs

commit c09b621195878300417315a97efdec25e67dd7f5
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:46:19 2024 +0800

    Black lint

commit 864a1aba26388276b7e57717b89520fcc77b3f62
Merge: ab898e4 ad8d9da
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:45:31 2024 +0800

    Merge branch 'main' into kc/list_tasks_num

commit ab898e4fd30bf83888125d48b80bc86b01cb5d39
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:44:23 2024 +0800

    Enable list all tasks num

commit c0ea54d49cb65b747d7e8fccac75838acabe05db
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:41:32 2024 +0800

    Exclude train yaml file in the task list

commit ad8d9da
Author: Zhang Peiyuan <a1286225768@gmail.com>
Date:   Sun Jan 28 02:04:57 2024 +0800

    Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

    * add mmme

    * black

    * add model specific prompt and gen kwargs

    * black

    * add yaml config to supprot multi-model eval

    * print table at the end

    * refactor multi model code

    * add chartqa

    * black

    * add ai2d

    * black

    * update chartqa

    * blacl

    * update ai2d dataset

    * black

    * add qwenvl

    * add infovqa and docvqa

* List task #num sorted

* Update prompt messages for image-related tasks

* Delete unused task configuration files

* Remove coco_train.yaml configuration file

* Update task name in mmmu.yaml

* Fix error message for missing tasks

* Add wandb import and integration

* Update generation kwargs for LMMS tasks

* Update lmms_eval MME task configuration and utils

* Update generation_kwargs in lmms_eval tasks

* Update doc_to_text function in coco and okvqa tasks

* Add COCO 2017 version

* Update task name in coco_test2017.yaml

* Squashed commit of the following:

commit 6ee856b
Author: Zhang Peiyuan <a1286225768@gmail.com>
Date:   Mon Jan 29 22:41:33 2024 +0800

    Add/mmmu test (EvolvingLMMs-Lab#30)

    * mmmu_test

    * black

commit 4a1183c
Author: Li Bo <drluodian@gmail.com>
Date:   Sun Jan 28 22:19:13 2024 +0800

    [Dataset Check] dataset check and add wandb logging (EvolvingLMMs-Lab#29)

    * Remove unused code and configuration file

    * Remove docvqa.yaml and update vizwizvqa.yaml

    * lint

    * Add dataset_kwargs to vizwizvqa.yaml

    * Add dataset_kwargs to vizwizvqa.yaml

    * textvqa (EvolvingLMMs-Lab#27)

    * Update textvqa.yaml and utils.py

    * Fix YAML formatting in textvqa.yaml and remove unused files

    * remove useless matric

    * add textvqa val & test

    * Update progress bar description in evaluator.py

    * Update submission file names in VizWizVQA tasks

    * Update output path to include log samples suffix

    * Update submission file paths in OKVQA and VizWizVQA tasks

    * Refactor llava-in-the-wild.yaml and utils.py

    * Update metric for llava evaluation

    * Refactor logging message in Task class

    * Merge commit 'ad8d9da1fb40c446202bf9b0095b02262df2ffc8'

    * Fix formatting issues and add progress bar closing statements

    * Update task from "infovqa_val" to "infovqa_test" in infovqa_test.yaml

    * Update tqdm progress bar in OtterHD model

    * Squashed commit of the following:

    commit c09b621195878300417315a97efdec25e67dd7f5
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:46:19 2024 +0800

        Black lint

    commit 864a1aba26388276b7e57717b89520fcc77b3f62
    Merge: ab898e4 ad8d9da
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:45:31 2024 +0800

        Merge branch 'main' into kc/list_tasks_num

    commit ab898e4fd30bf83888125d48b80bc86b01cb5d39
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:44:23 2024 +0800

        Enable list all tasks num

    commit c0ea54d49cb65b747d7e8fccac75838acabe05db
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:41:32 2024 +0800

        Exclude train yaml file in the task list

    commit ad8d9da
    Author: Zhang Peiyuan <a1286225768@gmail.com>
    Date:   Sun Jan 28 02:04:57 2024 +0800

        Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

        * add mmme

        * black

        * add model specific prompt and gen kwargs

        * black

        * add yaml config to supprot multi-model eval

        * print table at the end

        * refactor multi model code

        * add chartqa

        * black

        * add ai2d

        * black

        * update chartqa

        * blacl

        * update ai2d dataset

        * black

        * add qwenvl

        * add infovqa and docvqa

    * Fix error handling in loading YAML config files

    * Squashed commit of the following:

    commit dbba2fe6447b0dfd4bb89a368f62178f2b253006
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 12:41:40 2024 +0800

        Fix key bugs

    commit c09b621195878300417315a97efdec25e67dd7f5
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:46:19 2024 +0800

        Black lint

    commit 864a1aba26388276b7e57717b89520fcc77b3f62
    Merge: ab898e4 ad8d9da
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:45:31 2024 +0800

        Merge branch 'main' into kc/list_tasks_num

    commit ab898e4fd30bf83888125d48b80bc86b01cb5d39
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:44:23 2024 +0800

        Enable list all tasks num

    commit c0ea54d49cb65b747d7e8fccac75838acabe05db
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:41:32 2024 +0800

        Exclude train yaml file in the task list

    commit ad8d9da
    Author: Zhang Peiyuan <a1286225768@gmail.com>
    Date:   Sun Jan 28 02:04:57 2024 +0800

        Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

        * add mmme

        * black

        * add model specific prompt and gen kwargs

        * black

        * add yaml config to supprot multi-model eval

        * print table at the end

        * refactor multi model code

        * add chartqa

        * black

        * add ai2d

        * black

        * update chartqa

        * blacl

        * update ai2d dataset

        * black

        * add qwenvl

        * add infovqa and docvqa

    * List task #num sorted

    * Update prompt messages for image-related tasks

    * Delete unused task configuration files

    * Remove coco_train.yaml configuration file

    * Update task name in mmmu.yaml

    * Fix error message for missing tasks

    * Add wandb import and integration

    ---------

    Co-authored-by: Fanyi Pu <FPU001@e.ntu.edu.sg>
    Co-authored-by: kcz358 <92624596+kcz358@users.noreply.github.com>

* Refactor CLI evaluate function and improve error logging

---------

Co-authored-by: Fanyi Pu <FPU001@e.ntu.edu.sg>
Co-authored-by: kcz358 <92624596+kcz358@users.noreply.github.com>
kangreen0210 pushed a commit to kangreen0210/LIME that referenced this issue Oct 6, 2024
* Remove unused code and configuration file

* Remove docvqa.yaml and update vizwizvqa.yaml

* lint

* Add dataset_kwargs to vizwizvqa.yaml

* Add dataset_kwargs to vizwizvqa.yaml

* textvqa (EvolvingLMMs-Lab#27)

* Update textvqa.yaml and utils.py

* Fix YAML formatting in textvqa.yaml and remove unused files

* remove useless matric

* add textvqa val & test

* Update progress bar description in evaluator.py

* Update submission file names in VizWizVQA tasks

* Update output path to include log samples suffix

* Update submission file paths in OKVQA and VizWizVQA tasks

* Refactor llava-in-the-wild.yaml and utils.py

* Update metric for llava evaluation

* Refactor logging message in Task class

* Merge commit '0dc9a47afe9a61214f11053dae5641716052f30f'

* Fix formatting issues and add progress bar closing statements

* Update task from "infovqa_val" to "infovqa_test" in infovqa_test.yaml

* Update tqdm progress bar in OtterHD model

* Squashed commit of the following:

commit eae210c3700a59b7d5cc9de46fcb855f443096aa
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:46:19 2024 +0800

    Black lint

commit 18e4a19e82357352ab25df77b5ae4f1b011d61ae
Merge: ab898e4 fb209e4
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:45:31 2024 +0800

    Merge branch 'main' into kc/list_tasks_num

commit e899be48f55f95172fdf96bd2a98d3b91ff2aaed
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:44:23 2024 +0800

    Enable list all tasks num

commit a999fc6889c6986c28ec5d95460a4ab5233e5d4f
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:41:32 2024 +0800

    Exclude train yaml file in the task list

commit 0dc9a47
Author: Zhang Peiyuan <a1286225768@gmail.com>
Date:   Sun Jan 28 02:04:57 2024 +0800

    Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

    * add mmme

    * black

    * add model specific prompt and gen kwargs

    * black

    * add yaml config to supprot multi-model eval

    * print table at the end

    * refactor multi model code

    * add chartqa

    * black

    * add ai2d

    * black

    * update chartqa

    * blacl

    * update ai2d dataset

    * black

    * add qwenvl

    * add infovqa and docvqa

* Fix error handling in loading YAML config files

* Squashed commit of the following:

commit fdb0c6785b0c5d6979d10e7ddf75ce9055038db8
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 12:41:40 2024 +0800

    Fix key bugs

commit eae210c3700a59b7d5cc9de46fcb855f443096aa
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:46:19 2024 +0800

    Black lint

commit 18e4a19e82357352ab25df77b5ae4f1b011d61ae
Merge: ab898e4 fb209e4
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:45:31 2024 +0800

    Merge branch 'main' into kc/list_tasks_num

commit e899be48f55f95172fdf96bd2a98d3b91ff2aaed
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:44:23 2024 +0800

    Enable list all tasks num

commit a999fc6889c6986c28ec5d95460a4ab5233e5d4f
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:41:32 2024 +0800

    Exclude train yaml file in the task list

commit 0dc9a47
Author: Zhang Peiyuan <a1286225768@gmail.com>
Date:   Sun Jan 28 02:04:57 2024 +0800

    Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

    * add mmme

    * black

    * add model specific prompt and gen kwargs

    * black

    * add yaml config to supprot multi-model eval

    * print table at the end

    * refactor multi model code

    * add chartqa

    * black

    * add ai2d

    * black

    * update chartqa

    * blacl

    * update ai2d dataset

    * black

    * add qwenvl

    * add infovqa and docvqa

* List task #num sorted

* Update prompt messages for image-related tasks

* Delete unused task configuration files

* Remove coco_train.yaml configuration file

* Update task name in mmmu.yaml

* Fix error message for missing tasks

* Add wandb import and integration

* Update generation kwargs for LMMS tasks

* Update lmms_eval MME task configuration and utils

* Update generation_kwargs in lmms_eval tasks

* Update doc_to_text function in coco and okvqa tasks

* Add COCO 2017 version

* Update task name in coco_test2017.yaml

* Squashed commit of the following:

commit 9d06741
Author: Zhang Peiyuan <a1286225768@gmail.com>
Date:   Mon Jan 29 22:41:33 2024 +0800

    Add/mmmu test (EvolvingLMMs-Lab#30)

    * mmmu_test

    * black

commit 0d4e69f
Author: Li Bo <drluodian@gmail.com>
Date:   Sun Jan 28 22:19:13 2024 +0800

    [Dataset Check] dataset check and add wandb logging (EvolvingLMMs-Lab#29)

    * Remove unused code and configuration file

    * Remove docvqa.yaml and update vizwizvqa.yaml

    * lint

    * Add dataset_kwargs to vizwizvqa.yaml

    * Add dataset_kwargs to vizwizvqa.yaml

    * textvqa (EvolvingLMMs-Lab#27)

    * Update textvqa.yaml and utils.py

    * Fix YAML formatting in textvqa.yaml and remove unused files

    * remove useless matric

    * add textvqa val & test

    * Update progress bar description in evaluator.py

    * Update submission file names in VizWizVQA tasks

    * Update output path to include log samples suffix

    * Update submission file paths in OKVQA and VizWizVQA tasks

    * Refactor llava-in-the-wild.yaml and utils.py

    * Update metric for llava evaluation

    * Refactor logging message in Task class

    * Merge commit '0dc9a47afe9a61214f11053dae5641716052f30f'

    * Fix formatting issues and add progress bar closing statements

    * Update task from "infovqa_val" to "infovqa_test" in infovqa_test.yaml

    * Update tqdm progress bar in OtterHD model

    * Squashed commit of the following:

    commit eae210c3700a59b7d5cc9de46fcb855f443096aa
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:46:19 2024 +0800

        Black lint

    commit 18e4a19e82357352ab25df77b5ae4f1b011d61ae
    Merge: ab898e4 fb209e4
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:45:31 2024 +0800

        Merge branch 'main' into kc/list_tasks_num

    commit e899be48f55f95172fdf96bd2a98d3b91ff2aaed
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:44:23 2024 +0800

        Enable list all tasks num

    commit a999fc6889c6986c28ec5d95460a4ab5233e5d4f
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:41:32 2024 +0800

        Exclude train yaml file in the task list

    commit 0dc9a47
    Author: Zhang Peiyuan <a1286225768@gmail.com>
    Date:   Sun Jan 28 02:04:57 2024 +0800

        Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

        * add mmme

        * black

        * add model specific prompt and gen kwargs

        * black

        * add yaml config to supprot multi-model eval

        * print table at the end

        * refactor multi model code

        * add chartqa

        * black

        * add ai2d

        * black

        * update chartqa

        * blacl

        * update ai2d dataset

        * black

        * add qwenvl

        * add infovqa and docvqa

    * Fix error handling in loading YAML config files

    * Squashed commit of the following:

    commit fdb0c6785b0c5d6979d10e7ddf75ce9055038db8
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 12:41:40 2024 +0800

        Fix key bugs

    commit eae210c3700a59b7d5cc9de46fcb855f443096aa
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:46:19 2024 +0800

        Black lint

    commit 18e4a19e82357352ab25df77b5ae4f1b011d61ae
    Merge: ab898e4 fb209e4
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:45:31 2024 +0800

        Merge branch 'main' into kc/list_tasks_num

    commit e899be48f55f95172fdf96bd2a98d3b91ff2aaed
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:44:23 2024 +0800

        Enable list all tasks num

    commit a999fc6889c6986c28ec5d95460a4ab5233e5d4f
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:41:32 2024 +0800

        Exclude train yaml file in the task list

    commit 0dc9a47
    Author: Zhang Peiyuan <a1286225768@gmail.com>
    Date:   Sun Jan 28 02:04:57 2024 +0800

        Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

        * add mmme

        * black

        * add model specific prompt and gen kwargs

        * black

        * add yaml config to supprot multi-model eval

        * print table at the end

        * refactor multi model code

        * add chartqa

        * black

        * add ai2d

        * black

        * update chartqa

        * blacl

        * update ai2d dataset

        * black

        * add qwenvl

        * add infovqa and docvqa

    * List task #num sorted

    * Update prompt messages for image-related tasks

    * Delete unused task configuration files

    * Remove coco_train.yaml configuration file

    * Update task name in mmmu.yaml

    * Fix error message for missing tasks

    * Add wandb import and integration

    ---------

    Co-authored-by: Fanyi Pu <FPU001@e.ntu.edu.sg>
    Co-authored-by: kcz358 <92624596+kcz358@users.noreply.github.com>

* Refactor CLI evaluate function and improve error logging

---------

Co-authored-by: Fanyi Pu <FPU001@e.ntu.edu.sg>
Co-authored-by: kcz358 <92624596+kcz358@users.noreply.github.com>
kangreen0210 pushed a commit to kangreen0210/LIME that referenced this issue Oct 6, 2024
* Remove unused code and configuration file

* Remove docvqa.yaml and update vizwizvqa.yaml

* lint

* Add dataset_kwargs to vizwizvqa.yaml

* Add dataset_kwargs to vizwizvqa.yaml

* textvqa (EvolvingLMMs-Lab#27)

* Update textvqa.yaml and utils.py

* Fix YAML formatting in textvqa.yaml and remove unused files

* remove useless matric

* add textvqa val & test

* Update progress bar description in evaluator.py

* Update submission file names in VizWizVQA tasks

* Update output path to include log samples suffix

* Update submission file paths in OKVQA and VizWizVQA tasks

* Refactor llava-in-the-wild.yaml and utils.py

* Update metric for llava evaluation

* Refactor logging message in Task class

* Merge commit '5fb3e5d50de23f7f9f7bb10510e21ffb22c02adb'

* Fix formatting issues and add progress bar closing statements

* Update task from "infovqa_val" to "infovqa_test" in infovqa_test.yaml

* Update tqdm progress bar in OtterHD model

* Squashed commit of the following:

commit baf78ea27df4dfe5d88bc2abca707e117a4f9661
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:46:19 2024 +0800

    Black lint

commit e323545d9f3a5e0f2219618a4b024aea3ff6e353
Merge: ab898e4 fb209e4
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:45:31 2024 +0800

    Merge branch 'main' into kc/list_tasks_num

commit dbe09071a986c68e6b2b60cbde501da8d498535f
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:44:23 2024 +0800

    Enable list all tasks num

commit 844a47e5d49c71e5297decdf7510d8a1a214f934
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:41:32 2024 +0800

    Exclude train yaml file in the task list

commit 5fb3e5d
Author: Zhang Peiyuan <a1286225768@gmail.com>
Date:   Sun Jan 28 02:04:57 2024 +0800

    Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

    * add mmme

    * black

    * add model specific prompt and gen kwargs

    * black

    * add yaml config to supprot multi-model eval

    * print table at the end

    * refactor multi model code

    * add chartqa

    * black

    * add ai2d

    * black

    * update chartqa

    * blacl

    * update ai2d dataset

    * black

    * add qwenvl

    * add infovqa and docvqa

* Fix error handling in loading YAML config files

* Squashed commit of the following:

commit 992be447a9fdf701fc910177653017e3978bf56d
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 12:41:40 2024 +0800

    Fix key bugs

commit baf78ea27df4dfe5d88bc2abca707e117a4f9661
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:46:19 2024 +0800

    Black lint

commit e323545d9f3a5e0f2219618a4b024aea3ff6e353
Merge: ab898e4 fb209e4
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:45:31 2024 +0800

    Merge branch 'main' into kc/list_tasks_num

commit dbe09071a986c68e6b2b60cbde501da8d498535f
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:44:23 2024 +0800

    Enable list all tasks num

commit 844a47e5d49c71e5297decdf7510d8a1a214f934
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:41:32 2024 +0800

    Exclude train yaml file in the task list

commit 5fb3e5d
Author: Zhang Peiyuan <a1286225768@gmail.com>
Date:   Sun Jan 28 02:04:57 2024 +0800

    Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

    * add mmme

    * black

    * add model specific prompt and gen kwargs

    * black

    * add yaml config to supprot multi-model eval

    * print table at the end

    * refactor multi model code

    * add chartqa

    * black

    * add ai2d

    * black

    * update chartqa

    * blacl

    * update ai2d dataset

    * black

    * add qwenvl

    * add infovqa and docvqa

* List task #num sorted

* Update prompt messages for image-related tasks

* Delete unused task configuration files

* Remove coco_train.yaml configuration file

* Update task name in mmmu.yaml

* Fix error message for missing tasks

* Add wandb import and integration

* Update generation kwargs for LMMS tasks

* Update lmms_eval MME task configuration and utils

* Update generation_kwargs in lmms_eval tasks

* Update doc_to_text function in coco and okvqa tasks

* Add COCO 2017 version

* Update task name in coco_test2017.yaml

* Squashed commit of the following:

commit e19ec39
Author: Zhang Peiyuan <a1286225768@gmail.com>
Date:   Mon Jan 29 22:41:33 2024 +0800

    Add/mmmu test (EvolvingLMMs-Lab#30)

    * mmmu_test

    * black

commit 1c5dbd5
Author: Li Bo <drluodian@gmail.com>
Date:   Sun Jan 28 22:19:13 2024 +0800

    [Dataset Check] dataset check and add wandb logging (EvolvingLMMs-Lab#29)

    * Remove unused code and configuration file

    * Remove docvqa.yaml and update vizwizvqa.yaml

    * lint

    * Add dataset_kwargs to vizwizvqa.yaml

    * Add dataset_kwargs to vizwizvqa.yaml

    * textvqa (EvolvingLMMs-Lab#27)

    * Update textvqa.yaml and utils.py

    * Fix YAML formatting in textvqa.yaml and remove unused files

    * remove useless matric

    * add textvqa val & test

    * Update progress bar description in evaluator.py

    * Update submission file names in VizWizVQA tasks

    * Update output path to include log samples suffix

    * Update submission file paths in OKVQA and VizWizVQA tasks

    * Refactor llava-in-the-wild.yaml and utils.py

    * Update metric for llava evaluation

    * Refactor logging message in Task class

    * Merge commit '5fb3e5d50de23f7f9f7bb10510e21ffb22c02adb'

    * Fix formatting issues and add progress bar closing statements

    * Update task from "infovqa_val" to "infovqa_test" in infovqa_test.yaml

    * Update tqdm progress bar in OtterHD model

    * Squashed commit of the following:

    commit baf78ea27df4dfe5d88bc2abca707e117a4f9661
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:46:19 2024 +0800

        Black lint

    commit e323545d9f3a5e0f2219618a4b024aea3ff6e353
    Merge: ab898e4 fb209e4
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:45:31 2024 +0800

        Merge branch 'main' into kc/list_tasks_num

    commit dbe09071a986c68e6b2b60cbde501da8d498535f
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:44:23 2024 +0800

        Enable list all tasks num

    commit 844a47e5d49c71e5297decdf7510d8a1a214f934
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:41:32 2024 +0800

        Exclude train yaml file in the task list

    commit 5fb3e5d
    Author: Zhang Peiyuan <a1286225768@gmail.com>
    Date:   Sun Jan 28 02:04:57 2024 +0800

        Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

        * add mmme

        * black

        * add model specific prompt and gen kwargs

        * black

        * add yaml config to supprot multi-model eval

        * print table at the end

        * refactor multi model code

        * add chartqa

        * black

        * add ai2d

        * black

        * update chartqa

        * blacl

        * update ai2d dataset

        * black

        * add qwenvl

        * add infovqa and docvqa

    * Fix error handling in loading YAML config files

    * Squashed commit of the following:

    commit 992be447a9fdf701fc910177653017e3978bf56d
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 12:41:40 2024 +0800

        Fix key bugs

    commit baf78ea27df4dfe5d88bc2abca707e117a4f9661
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:46:19 2024 +0800

        Black lint

    commit e323545d9f3a5e0f2219618a4b024aea3ff6e353
    Merge: ab898e4 fb209e4
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:45:31 2024 +0800

        Merge branch 'main' into kc/list_tasks_num

    commit dbe09071a986c68e6b2b60cbde501da8d498535f
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:44:23 2024 +0800

        Enable list all tasks num

    commit 844a47e5d49c71e5297decdf7510d8a1a214f934
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:41:32 2024 +0800

        Exclude train yaml file in the task list

    commit 5fb3e5d
    Author: Zhang Peiyuan <a1286225768@gmail.com>
    Date:   Sun Jan 28 02:04:57 2024 +0800

        Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

        * add mmme

        * black

        * add model specific prompt and gen kwargs

        * black

        * add yaml config to supprot multi-model eval

        * print table at the end

        * refactor multi model code

        * add chartqa

        * black

        * add ai2d

        * black

        * update chartqa

        * blacl

        * update ai2d dataset

        * black

        * add qwenvl

        * add infovqa and docvqa

    * List task #num sorted

    * Update prompt messages for image-related tasks

    * Delete unused task configuration files

    * Remove coco_train.yaml configuration file

    * Update task name in mmmu.yaml

    * Fix error message for missing tasks

    * Add wandb import and integration

    ---------

    Co-authored-by: Fanyi Pu <FPU001@e.ntu.edu.sg>
    Co-authored-by: kcz358 <92624596+kcz358@users.noreply.github.com>

* Refactor CLI evaluate function and improve error logging

---------

Co-authored-by: Fanyi Pu <FPU001@e.ntu.edu.sg>
Co-authored-by: kcz358 <92624596+kcz358@users.noreply.github.com>
kangreen0210 pushed a commit to kangreen0210/LIME that referenced this issue Oct 6, 2024
* Remove unused code and configuration file

* Remove docvqa.yaml and update vizwizvqa.yaml

* lint

* Add dataset_kwargs to vizwizvqa.yaml

* Add dataset_kwargs to vizwizvqa.yaml

* textvqa (EvolvingLMMs-Lab#27)

* Update textvqa.yaml and utils.py

* Fix YAML formatting in textvqa.yaml and remove unused files

* remove useless matric

* add textvqa val & test

* Update progress bar description in evaluator.py

* Update submission file names in VizWizVQA tasks

* Update output path to include log samples suffix

* Update submission file paths in OKVQA and VizWizVQA tasks

* Refactor llava-in-the-wild.yaml and utils.py

* Update metric for llava evaluation

* Refactor logging message in Task class

* Merge commit '4b30564ccba6af8112cd9fedf36a16bb6571b1d9'

* Fix formatting issues and add progress bar closing statements

* Update task from "infovqa_val" to "infovqa_test" in infovqa_test.yaml

* Update tqdm progress bar in OtterHD model

* Squashed commit of the following:

commit eae210c3700a59b7d5cc9de46fcb855f443096aa
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:46:19 2024 +0800

    Black lint

commit 18e4a19e82357352ab25df77b5ae4f1b011d61ae
Merge: ab898e4 fb209e4
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:45:31 2024 +0800

    Merge branch 'main' into kc/list_tasks_num

commit e899be48f55f95172fdf96bd2a98d3b91ff2aaed
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:44:23 2024 +0800

    Enable list all tasks num

commit a999fc6889c6986c28ec5d95460a4ab5233e5d4f
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:41:32 2024 +0800

    Exclude train yaml file in the task list

commit 4b30564
Author: Zhang Peiyuan <a1286225768@gmail.com>
Date:   Sun Jan 28 02:04:57 2024 +0800

    Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

    * add mmme

    * black

    * add model specific prompt and gen kwargs

    * black

    * add yaml config to supprot multi-model eval

    * print table at the end

    * refactor multi model code

    * add chartqa

    * black

    * add ai2d

    * black

    * update chartqa

    * blacl

    * update ai2d dataset

    * black

    * add qwenvl

    * add infovqa and docvqa

* Fix error handling in loading YAML config files

* Squashed commit of the following:

commit fdb0c6785b0c5d6979d10e7ddf75ce9055038db8
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 12:41:40 2024 +0800

    Fix key bugs

commit eae210c3700a59b7d5cc9de46fcb855f443096aa
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:46:19 2024 +0800

    Black lint

commit 18e4a19e82357352ab25df77b5ae4f1b011d61ae
Merge: ab898e4 fb209e4
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:45:31 2024 +0800

    Merge branch 'main' into kc/list_tasks_num

commit e899be48f55f95172fdf96bd2a98d3b91ff2aaed
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:44:23 2024 +0800

    Enable list all tasks num

commit a999fc6889c6986c28ec5d95460a4ab5233e5d4f
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:41:32 2024 +0800

    Exclude train yaml file in the task list

commit 4b30564
Author: Zhang Peiyuan <a1286225768@gmail.com>
Date:   Sun Jan 28 02:04:57 2024 +0800

    Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

    * add mmme

    * black

    * add model specific prompt and gen kwargs

    * black

    * add yaml config to supprot multi-model eval

    * print table at the end

    * refactor multi model code

    * add chartqa

    * black

    * add ai2d

    * black

    * update chartqa

    * blacl

    * update ai2d dataset

    * black

    * add qwenvl

    * add infovqa and docvqa

* List task #num sorted

* Update prompt messages for image-related tasks

* Delete unused task configuration files

* Remove coco_train.yaml configuration file

* Update task name in mmmu.yaml

* Fix error message for missing tasks

* Add wandb import and integration

* Update generation kwargs for LMMS tasks

* Update lmms_eval MME task configuration and utils

* Update generation_kwargs in lmms_eval tasks

* Update doc_to_text function in coco and okvqa tasks

* Add COCO 2017 version

* Update task name in coco_test2017.yaml

* Squashed commit of the following:

commit 27ab736
Author: Zhang Peiyuan <a1286225768@gmail.com>
Date:   Mon Jan 29 22:41:33 2024 +0800

    Add/mmmu test (EvolvingLMMs-Lab#30)

    * mmmu_test

    * black

commit 1c11ae4
Author: Li Bo <drluodian@gmail.com>
Date:   Sun Jan 28 22:19:13 2024 +0800

    [Dataset Check] dataset check and add wandb logging (EvolvingLMMs-Lab#29)

    * Remove unused code and configuration file

    * Remove docvqa.yaml and update vizwizvqa.yaml

    * lint

    * Add dataset_kwargs to vizwizvqa.yaml

    * Add dataset_kwargs to vizwizvqa.yaml

    * textvqa (EvolvingLMMs-Lab#27)

    * Update textvqa.yaml and utils.py

    * Fix YAML formatting in textvqa.yaml and remove unused files

    * remove useless matric

    * add textvqa val & test

    * Update progress bar description in evaluator.py

    * Update submission file names in VizWizVQA tasks

    * Update output path to include log samples suffix

    * Update submission file paths in OKVQA and VizWizVQA tasks

    * Refactor llava-in-the-wild.yaml and utils.py

    * Update metric for llava evaluation

    * Refactor logging message in Task class

    * Merge commit '4b30564ccba6af8112cd9fedf36a16bb6571b1d9'

    * Fix formatting issues and add progress bar closing statements

    * Update task from "infovqa_val" to "infovqa_test" in infovqa_test.yaml

    * Update tqdm progress bar in OtterHD model

    * Squashed commit of the following:

    commit eae210c3700a59b7d5cc9de46fcb855f443096aa
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:46:19 2024 +0800

        Black lint

    commit 18e4a19e82357352ab25df77b5ae4f1b011d61ae
    Merge: ab898e4 fb209e4
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:45:31 2024 +0800

        Merge branch 'main' into kc/list_tasks_num

    commit e899be48f55f95172fdf96bd2a98d3b91ff2aaed
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:44:23 2024 +0800

        Enable list all tasks num

    commit a999fc6889c6986c28ec5d95460a4ab5233e5d4f
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:41:32 2024 +0800

        Exclude train yaml file in the task list

    commit 4b30564
    Author: Zhang Peiyuan <a1286225768@gmail.com>
    Date:   Sun Jan 28 02:04:57 2024 +0800

        Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

        * add mmme

        * black

        * add model specific prompt and gen kwargs

        * black

        * add yaml config to supprot multi-model eval

        * print table at the end

        * refactor multi model code

        * add chartqa

        * black

        * add ai2d

        * black

        * update chartqa

        * blacl

        * update ai2d dataset

        * black

        * add qwenvl

        * add infovqa and docvqa

    * Fix error handling in loading YAML config files

    * Squashed commit of the following:

    commit fdb0c6785b0c5d6979d10e7ddf75ce9055038db8
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 12:41:40 2024 +0800

        Fix key bugs

    commit eae210c3700a59b7d5cc9de46fcb855f443096aa
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:46:19 2024 +0800

        Black lint

    commit 18e4a19e82357352ab25df77b5ae4f1b011d61ae
    Merge: ab898e4 fb209e4
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:45:31 2024 +0800

        Merge branch 'main' into kc/list_tasks_num

    commit e899be48f55f95172fdf96bd2a98d3b91ff2aaed
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:44:23 2024 +0800

        Enable list all tasks num

    commit a999fc6889c6986c28ec5d95460a4ab5233e5d4f
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:41:32 2024 +0800

        Exclude train yaml file in the task list

    commit 4b30564
    Author: Zhang Peiyuan <a1286225768@gmail.com>
    Date:   Sun Jan 28 02:04:57 2024 +0800

        Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

        * add mmme

        * black

        * add model specific prompt and gen kwargs

        * black

        * add yaml config to supprot multi-model eval

        * print table at the end

        * refactor multi model code

        * add chartqa

        * black

        * add ai2d

        * black

        * update chartqa

        * blacl

        * update ai2d dataset

        * black

        * add qwenvl

        * add infovqa and docvqa

    * List task #num sorted

    * Update prompt messages for image-related tasks

    * Delete unused task configuration files

    * Remove coco_train.yaml configuration file

    * Update task name in mmmu.yaml

    * Fix error message for missing tasks

    * Add wandb import and integration

    ---------

    Co-authored-by: Fanyi Pu <FPU001@e.ntu.edu.sg>
    Co-authored-by: kcz358 <92624596+kcz358@users.noreply.github.com>

* Refactor CLI evaluate function and improve error logging

---------

Co-authored-by: Fanyi Pu <FPU001@e.ntu.edu.sg>
Co-authored-by: kcz358 <92624596+kcz358@users.noreply.github.com>
kangreen0210 pushed a commit to kangreen0210/LIME that referenced this issue Oct 6, 2024
* Remove unused code and configuration file

* Remove docvqa.yaml and update vizwizvqa.yaml

* lint

* Add dataset_kwargs to vizwizvqa.yaml

* Add dataset_kwargs to vizwizvqa.yaml

* textvqa (EvolvingLMMs-Lab#27)

* Update textvqa.yaml and utils.py

* Fix YAML formatting in textvqa.yaml and remove unused files

* remove useless matric

* add textvqa val & test

* Update progress bar description in evaluator.py

* Update submission file names in VizWizVQA tasks

* Update output path to include log samples suffix

* Update submission file paths in OKVQA and VizWizVQA tasks

* Refactor llava-in-the-wild.yaml and utils.py

* Update metric for llava evaluation

* Refactor logging message in Task class

* Merge commit 'e811faca3743a9b0c865144145198cc5eea21393'

* Fix formatting issues and add progress bar closing statements

* Update task from "infovqa_val" to "infovqa_test" in infovqa_test.yaml

* Update tqdm progress bar in OtterHD model

* Squashed commit of the following:

commit 290c53c0ea60868d2f0fb31bee1ac8d213b08d36
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:46:19 2024 +0800

    Black lint

commit 27bc5c84f9d9f2ff56b2adfa69d23894f4027100
Merge: ab898e4 fb209e4
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:45:31 2024 +0800

    Merge branch 'main' into kc/list_tasks_num

commit 09d42b879158738f5484f31d514c6b400a418551
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:44:23 2024 +0800

    Enable list all tasks num

commit e8110aacf87bb0450db298b0993164765e0a624f
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:41:32 2024 +0800

    Exclude train yaml file in the task list

commit e811fac
Author: Zhang Peiyuan <a1286225768@gmail.com>
Date:   Sun Jan 28 02:04:57 2024 +0800

    Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

    * add mmme

    * black

    * add model specific prompt and gen kwargs

    * black

    * add yaml config to supprot multi-model eval

    * print table at the end

    * refactor multi model code

    * add chartqa

    * black

    * add ai2d

    * black

    * update chartqa

    * blacl

    * update ai2d dataset

    * black

    * add qwenvl

    * add infovqa and docvqa

* Fix error handling in loading YAML config files

* Squashed commit of the following:

commit 15f168756d8f92f53dea87548efe606d0d1401b5
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 12:41:40 2024 +0800

    Fix key bugs

commit 290c53c0ea60868d2f0fb31bee1ac8d213b08d36
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:46:19 2024 +0800

    Black lint

commit 27bc5c84f9d9f2ff56b2adfa69d23894f4027100
Merge: ab898e4 fb209e4
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:45:31 2024 +0800

    Merge branch 'main' into kc/list_tasks_num

commit 09d42b879158738f5484f31d514c6b400a418551
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:44:23 2024 +0800

    Enable list all tasks num

commit e8110aacf87bb0450db298b0993164765e0a624f
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:41:32 2024 +0800

    Exclude train yaml file in the task list

commit e811fac
Author: Zhang Peiyuan <a1286225768@gmail.com>
Date:   Sun Jan 28 02:04:57 2024 +0800

    Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

    * add mmme

    * black

    * add model specific prompt and gen kwargs

    * black

    * add yaml config to supprot multi-model eval

    * print table at the end

    * refactor multi model code

    * add chartqa

    * black

    * add ai2d

    * black

    * update chartqa

    * blacl

    * update ai2d dataset

    * black

    * add qwenvl

    * add infovqa and docvqa

* List task #num sorted

* Update prompt messages for image-related tasks

* Delete unused task configuration files

* Remove coco_train.yaml configuration file

* Update task name in mmmu.yaml

* Fix error message for missing tasks

* Add wandb import and integration

* Update generation kwargs for LMMS tasks

* Update lmms_eval MME task configuration and utils

* Update generation_kwargs in lmms_eval tasks

* Update doc_to_text function in coco and okvqa tasks

* Add COCO 2017 version

* Update task name in coco_test2017.yaml

* Squashed commit of the following:

commit da7a8df
Author: Zhang Peiyuan <a1286225768@gmail.com>
Date:   Mon Jan 29 22:41:33 2024 +0800

    Add/mmmu test (EvolvingLMMs-Lab#30)

    * mmmu_test

    * black

commit a853223
Author: Li Bo <drluodian@gmail.com>
Date:   Sun Jan 28 22:19:13 2024 +0800

    [Dataset Check] dataset check and add wandb logging (EvolvingLMMs-Lab#29)

    * Remove unused code and configuration file

    * Remove docvqa.yaml and update vizwizvqa.yaml

    * lint

    * Add dataset_kwargs to vizwizvqa.yaml

    * Add dataset_kwargs to vizwizvqa.yaml

    * textvqa (EvolvingLMMs-Lab#27)

    * Update textvqa.yaml and utils.py

    * Fix YAML formatting in textvqa.yaml and remove unused files

    * remove useless matric

    * add textvqa val & test

    * Update progress bar description in evaluator.py

    * Update submission file names in VizWizVQA tasks

    * Update output path to include log samples suffix

    * Update submission file paths in OKVQA and VizWizVQA tasks

    * Refactor llava-in-the-wild.yaml and utils.py

    * Update metric for llava evaluation

    * Refactor logging message in Task class

    * Merge commit 'e811faca3743a9b0c865144145198cc5eea21393'

    * Fix formatting issues and add progress bar closing statements

    * Update task from "infovqa_val" to "infovqa_test" in infovqa_test.yaml

    * Update tqdm progress bar in OtterHD model

    * Squashed commit of the following:

    commit 290c53c0ea60868d2f0fb31bee1ac8d213b08d36
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:46:19 2024 +0800

        Black lint

    commit 27bc5c84f9d9f2ff56b2adfa69d23894f4027100
    Merge: ab898e4 fb209e4
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:45:31 2024 +0800

        Merge branch 'main' into kc/list_tasks_num

    commit 09d42b879158738f5484f31d514c6b400a418551
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:44:23 2024 +0800

        Enable list all tasks num

    commit e8110aacf87bb0450db298b0993164765e0a624f
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:41:32 2024 +0800

        Exclude train yaml file in the task list

    commit e811fac
    Author: Zhang Peiyuan <a1286225768@gmail.com>
    Date:   Sun Jan 28 02:04:57 2024 +0800

        Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

        * add mmme

        * black

        * add model specific prompt and gen kwargs

        * black

        * add yaml config to supprot multi-model eval

        * print table at the end

        * refactor multi model code

        * add chartqa

        * black

        * add ai2d

        * black

        * update chartqa

        * blacl

        * update ai2d dataset

        * black

        * add qwenvl

        * add infovqa and docvqa

    * Fix error handling in loading YAML config files

    * Squashed commit of the following:

    commit 15f168756d8f92f53dea87548efe606d0d1401b5
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 12:41:40 2024 +0800

        Fix key bugs

    commit 290c53c0ea60868d2f0fb31bee1ac8d213b08d36
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:46:19 2024 +0800

        Black lint

    commit 27bc5c84f9d9f2ff56b2adfa69d23894f4027100
    Merge: ab898e4 fb209e4
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:45:31 2024 +0800

        Merge branch 'main' into kc/list_tasks_num

    commit 09d42b879158738f5484f31d514c6b400a418551
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:44:23 2024 +0800

        Enable list all tasks num

    commit e8110aacf87bb0450db298b0993164765e0a624f
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:41:32 2024 +0800

        Exclude train yaml file in the task list

    commit e811fac
    Author: Zhang Peiyuan <a1286225768@gmail.com>
    Date:   Sun Jan 28 02:04:57 2024 +0800

        Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

        * add mmme

        * black

        * add model specific prompt and gen kwargs

        * black

        * add yaml config to supprot multi-model eval

        * print table at the end

        * refactor multi model code

        * add chartqa

        * black

        * add ai2d

        * black

        * update chartqa

        * blacl

        * update ai2d dataset

        * black

        * add qwenvl

        * add infovqa and docvqa

    * List task #num sorted

    * Update prompt messages for image-related tasks

    * Delete unused task configuration files

    * Remove coco_train.yaml configuration file

    * Update task name in mmmu.yaml

    * Fix error message for missing tasks

    * Add wandb import and integration

    ---------

    Co-authored-by: Fanyi Pu <FPU001@e.ntu.edu.sg>
    Co-authored-by: kcz358 <92624596+kcz358@users.noreply.github.com>

* Refactor CLI evaluate function and improve error logging

---------

Co-authored-by: Fanyi Pu <FPU001@e.ntu.edu.sg>
Co-authored-by: kcz358 <92624596+kcz358@users.noreply.github.com>
kangreen0210 pushed a commit to kangreen0210/LIME that referenced this issue Oct 6, 2024
* Remove unused code and configuration file

* Remove docvqa.yaml and update vizwizvqa.yaml

* lint

* Add dataset_kwargs to vizwizvqa.yaml

* Add dataset_kwargs to vizwizvqa.yaml

* textvqa (EvolvingLMMs-Lab#27)

* Update textvqa.yaml and utils.py

* Fix YAML formatting in textvqa.yaml and remove unused files

* remove useless matric

* add textvqa val & test

* Update progress bar description in evaluator.py

* Update submission file names in VizWizVQA tasks

* Update output path to include log samples suffix

* Update submission file paths in OKVQA and VizWizVQA tasks

* Refactor llava-in-the-wild.yaml and utils.py

* Update metric for llava evaluation

* Refactor logging message in Task class

* Merge commit 'f92c3d6d10a8b0b7a0b42baa60cb364b99525b4e'

* Fix formatting issues and add progress bar closing statements

* Update task from "infovqa_val" to "infovqa_test" in infovqa_test.yaml

* Update tqdm progress bar in OtterHD model

* Squashed commit of the following:

commit eae210c3700a59b7d5cc9de46fcb855f443096aa
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:46:19 2024 +0800

    Black lint

commit 18e4a19e82357352ab25df77b5ae4f1b011d61ae
Merge: ab898e4 fb209e4
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:45:31 2024 +0800

    Merge branch 'main' into kc/list_tasks_num

commit e899be48f55f95172fdf96bd2a98d3b91ff2aaed
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:44:23 2024 +0800

    Enable list all tasks num

commit a999fc6889c6986c28ec5d95460a4ab5233e5d4f
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:41:32 2024 +0800

    Exclude train yaml file in the task list

commit f92c3d6
Author: Zhang Peiyuan <a1286225768@gmail.com>
Date:   Sun Jan 28 02:04:57 2024 +0800

    Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

    * add mmme

    * black

    * add model specific prompt and gen kwargs

    * black

    * add yaml config to supprot multi-model eval

    * print table at the end

    * refactor multi model code

    * add chartqa

    * black

    * add ai2d

    * black

    * update chartqa

    * blacl

    * update ai2d dataset

    * black

    * add qwenvl

    * add infovqa and docvqa

* Fix error handling in loading YAML config files

* Squashed commit of the following:

commit fdb0c6785b0c5d6979d10e7ddf75ce9055038db8
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 12:41:40 2024 +0800

    Fix key bugs

commit eae210c3700a59b7d5cc9de46fcb855f443096aa
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:46:19 2024 +0800

    Black lint

commit 18e4a19e82357352ab25df77b5ae4f1b011d61ae
Merge: ab898e4 fb209e4
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:45:31 2024 +0800

    Merge branch 'main' into kc/list_tasks_num

commit e899be48f55f95172fdf96bd2a98d3b91ff2aaed
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:44:23 2024 +0800

    Enable list all tasks num

commit a999fc6889c6986c28ec5d95460a4ab5233e5d4f
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:41:32 2024 +0800

    Exclude train yaml file in the task list

commit f92c3d6
Author: Zhang Peiyuan <a1286225768@gmail.com>
Date:   Sun Jan 28 02:04:57 2024 +0800

    Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

    * add mmme

    * black

    * add model specific prompt and gen kwargs

    * black

    * add yaml config to supprot multi-model eval

    * print table at the end

    * refactor multi model code

    * add chartqa

    * black

    * add ai2d

    * black

    * update chartqa

    * blacl

    * update ai2d dataset

    * black

    * add qwenvl

    * add infovqa and docvqa

* List task #num sorted

* Update prompt messages for image-related tasks

* Delete unused task configuration files

* Remove coco_train.yaml configuration file

* Update task name in mmmu.yaml

* Fix error message for missing tasks

* Add wandb import and integration

* Update generation kwargs for LMMS tasks

* Update lmms_eval MME task configuration and utils

* Update generation_kwargs in lmms_eval tasks

* Update doc_to_text function in coco and okvqa tasks

* Add COCO 2017 version

* Update task name in coco_test2017.yaml

* Squashed commit of the following:

commit fbb7aa5
Author: Zhang Peiyuan <a1286225768@gmail.com>
Date:   Mon Jan 29 22:41:33 2024 +0800

    Add/mmmu test (EvolvingLMMs-Lab#30)

    * mmmu_test

    * black

commit b8ba33c
Author: Li Bo <drluodian@gmail.com>
Date:   Sun Jan 28 22:19:13 2024 +0800

    [Dataset Check] dataset check and add wandb logging (EvolvingLMMs-Lab#29)

    * Remove unused code and configuration file

    * Remove docvqa.yaml and update vizwizvqa.yaml

    * lint

    * Add dataset_kwargs to vizwizvqa.yaml

    * Add dataset_kwargs to vizwizvqa.yaml

    * textvqa (EvolvingLMMs-Lab#27)

    * Update textvqa.yaml and utils.py

    * Fix YAML formatting in textvqa.yaml and remove unused files

    * remove useless matric

    * add textvqa val & test

    * Update progress bar description in evaluator.py

    * Update submission file names in VizWizVQA tasks

    * Update output path to include log samples suffix

    * Update submission file paths in OKVQA and VizWizVQA tasks

    * Refactor llava-in-the-wild.yaml and utils.py

    * Update metric for llava evaluation

    * Refactor logging message in Task class

    * Merge commit 'f92c3d6d10a8b0b7a0b42baa60cb364b99525b4e'

    * Fix formatting issues and add progress bar closing statements

    * Update task from "infovqa_val" to "infovqa_test" in infovqa_test.yaml

    * Update tqdm progress bar in OtterHD model

    * Squashed commit of the following:

    commit eae210c3700a59b7d5cc9de46fcb855f443096aa
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:46:19 2024 +0800

        Black lint

    commit 18e4a19e82357352ab25df77b5ae4f1b011d61ae
    Merge: ab898e4 fb209e4
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:45:31 2024 +0800

        Merge branch 'main' into kc/list_tasks_num

    commit e899be48f55f95172fdf96bd2a98d3b91ff2aaed
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:44:23 2024 +0800

        Enable list all tasks num

    commit a999fc6889c6986c28ec5d95460a4ab5233e5d4f
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:41:32 2024 +0800

        Exclude train yaml file in the task list

    commit f92c3d6
    Author: Zhang Peiyuan <a1286225768@gmail.com>
    Date:   Sun Jan 28 02:04:57 2024 +0800

        Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

        * add mmme

        * black

        * add model specific prompt and gen kwargs

        * black

        * add yaml config to supprot multi-model eval

        * print table at the end

        * refactor multi model code

        * add chartqa

        * black

        * add ai2d

        * black

        * update chartqa

        * blacl

        * update ai2d dataset

        * black

        * add qwenvl

        * add infovqa and docvqa

    * Fix error handling in loading YAML config files

    * Squashed commit of the following:

    commit fdb0c6785b0c5d6979d10e7ddf75ce9055038db8
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 12:41:40 2024 +0800

        Fix key bugs

    commit eae210c3700a59b7d5cc9de46fcb855f443096aa
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:46:19 2024 +0800

        Black lint

    commit 18e4a19e82357352ab25df77b5ae4f1b011d61ae
    Merge: ab898e4 fb209e4
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:45:31 2024 +0800

        Merge branch 'main' into kc/list_tasks_num

    commit e899be48f55f95172fdf96bd2a98d3b91ff2aaed
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:44:23 2024 +0800

        Enable list all tasks num

    commit a999fc6889c6986c28ec5d95460a4ab5233e5d4f
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:41:32 2024 +0800

        Exclude train yaml file in the task list

    commit f92c3d6
    Author: Zhang Peiyuan <a1286225768@gmail.com>
    Date:   Sun Jan 28 02:04:57 2024 +0800

        Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

        * add mmme

        * black

        * add model specific prompt and gen kwargs

        * black

        * add yaml config to supprot multi-model eval

        * print table at the end

        * refactor multi model code

        * add chartqa

        * black

        * add ai2d

        * black

        * update chartqa

        * blacl

        * update ai2d dataset

        * black

        * add qwenvl

        * add infovqa and docvqa

    * List task #num sorted

    * Update prompt messages for image-related tasks

    * Delete unused task configuration files

    * Remove coco_train.yaml configuration file

    * Update task name in mmmu.yaml

    * Fix error message for missing tasks

    * Add wandb import and integration

    ---------

    Co-authored-by: Fanyi Pu <FPU001@e.ntu.edu.sg>
    Co-authored-by: kcz358 <92624596+kcz358@users.noreply.github.com>

* Refactor CLI evaluate function and improve error logging

---------

Co-authored-by: Fanyi Pu <FPU001@e.ntu.edu.sg>
Co-authored-by: kcz358 <92624596+kcz358@users.noreply.github.com>
kangreen0210 pushed a commit to kangreen0210/LIME that referenced this issue Oct 6, 2024
* Remove unused code and configuration file

* Remove docvqa.yaml and update vizwizvqa.yaml

* lint

* Add dataset_kwargs to vizwizvqa.yaml

* Add dataset_kwargs to vizwizvqa.yaml

* textvqa (EvolvingLMMs-Lab#27)

* Update textvqa.yaml and utils.py

* Fix YAML formatting in textvqa.yaml and remove unused files

* remove useless matric

* add textvqa val & test

* Update progress bar description in evaluator.py

* Update submission file names in VizWizVQA tasks

* Update output path to include log samples suffix

* Update submission file paths in OKVQA and VizWizVQA tasks

* Refactor llava-in-the-wild.yaml and utils.py

* Update metric for llava evaluation

* Refactor logging message in Task class

* Merge commit '89545d0517eb5891710f2d7191ca7b650723701e'

* Fix formatting issues and add progress bar closing statements

* Update task from "infovqa_val" to "infovqa_test" in infovqa_test.yaml

* Update tqdm progress bar in OtterHD model

* Squashed commit of the following:

commit baf78ea27df4dfe5d88bc2abca707e117a4f9661
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:46:19 2024 +0800

    Black lint

commit e323545d9f3a5e0f2219618a4b024aea3ff6e353
Merge: ab898e4 fb209e4
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:45:31 2024 +0800

    Merge branch 'main' into kc/list_tasks_num

commit dbe09071a986c68e6b2b60cbde501da8d498535f
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:44:23 2024 +0800

    Enable list all tasks num

commit 844a47e5d49c71e5297decdf7510d8a1a214f934
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:41:32 2024 +0800

    Exclude train yaml file in the task list

commit 89545d0
Author: Zhang Peiyuan <a1286225768@gmail.com>
Date:   Sun Jan 28 02:04:57 2024 +0800

    Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

    * add mmme

    * black

    * add model specific prompt and gen kwargs

    * black

    * add yaml config to supprot multi-model eval

    * print table at the end

    * refactor multi model code

    * add chartqa

    * black

    * add ai2d

    * black

    * update chartqa

    * blacl

    * update ai2d dataset

    * black

    * add qwenvl

    * add infovqa and docvqa

* Fix error handling in loading YAML config files

* Squashed commit of the following:

commit 992be447a9fdf701fc910177653017e3978bf56d
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 12:41:40 2024 +0800

    Fix key bugs

commit baf78ea27df4dfe5d88bc2abca707e117a4f9661
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:46:19 2024 +0800

    Black lint

commit e323545d9f3a5e0f2219618a4b024aea3ff6e353
Merge: ab898e4 fb209e4
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:45:31 2024 +0800

    Merge branch 'main' into kc/list_tasks_num

commit dbe09071a986c68e6b2b60cbde501da8d498535f
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:44:23 2024 +0800

    Enable list all tasks num

commit 844a47e5d49c71e5297decdf7510d8a1a214f934
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:41:32 2024 +0800

    Exclude train yaml file in the task list

commit 89545d0
Author: Zhang Peiyuan <a1286225768@gmail.com>
Date:   Sun Jan 28 02:04:57 2024 +0800

    Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

    * add mmme

    * black

    * add model specific prompt and gen kwargs

    * black

    * add yaml config to supprot multi-model eval

    * print table at the end

    * refactor multi model code

    * add chartqa

    * black

    * add ai2d

    * black

    * update chartqa

    * blacl

    * update ai2d dataset

    * black

    * add qwenvl

    * add infovqa and docvqa

* List task #num sorted

* Update prompt messages for image-related tasks

* Delete unused task configuration files

* Remove coco_train.yaml configuration file

* Update task name in mmmu.yaml

* Fix error message for missing tasks

* Add wandb import and integration

* Update generation kwargs for LMMS tasks

* Update lmms_eval MME task configuration and utils

* Update generation_kwargs in lmms_eval tasks

* Update doc_to_text function in coco and okvqa tasks

* Add COCO 2017 version

* Update task name in coco_test2017.yaml

* Squashed commit of the following:

commit a2b4a2a
Author: Zhang Peiyuan <a1286225768@gmail.com>
Date:   Mon Jan 29 22:41:33 2024 +0800

    Add/mmmu test (EvolvingLMMs-Lab#30)

    * mmmu_test

    * black

commit 47a6675
Author: Li Bo <drluodian@gmail.com>
Date:   Sun Jan 28 22:19:13 2024 +0800

    [Dataset Check] dataset check and add wandb logging (EvolvingLMMs-Lab#29)

    * Remove unused code and configuration file

    * Remove docvqa.yaml and update vizwizvqa.yaml

    * lint

    * Add dataset_kwargs to vizwizvqa.yaml

    * Add dataset_kwargs to vizwizvqa.yaml

    * textvqa (EvolvingLMMs-Lab#27)

    * Update textvqa.yaml and utils.py

    * Fix YAML formatting in textvqa.yaml and remove unused files

    * remove useless matric

    * add textvqa val & test

    * Update progress bar description in evaluator.py

    * Update submission file names in VizWizVQA tasks

    * Update output path to include log samples suffix

    * Update submission file paths in OKVQA and VizWizVQA tasks

    * Refactor llava-in-the-wild.yaml and utils.py

    * Update metric for llava evaluation

    * Refactor logging message in Task class

    * Merge commit '89545d0517eb5891710f2d7191ca7b650723701e'

    * Fix formatting issues and add progress bar closing statements

    * Update task from "infovqa_val" to "infovqa_test" in infovqa_test.yaml

    * Update tqdm progress bar in OtterHD model

    * Squashed commit of the following:

    commit baf78ea27df4dfe5d88bc2abca707e117a4f9661
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:46:19 2024 +0800

        Black lint

    commit e323545d9f3a5e0f2219618a4b024aea3ff6e353
    Merge: ab898e4 fb209e4
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:45:31 2024 +0800

        Merge branch 'main' into kc/list_tasks_num

    commit dbe09071a986c68e6b2b60cbde501da8d498535f
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:44:23 2024 +0800

        Enable list all tasks num

    commit 844a47e5d49c71e5297decdf7510d8a1a214f934
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:41:32 2024 +0800

        Exclude train yaml file in the task list

    commit 89545d0
    Author: Zhang Peiyuan <a1286225768@gmail.com>
    Date:   Sun Jan 28 02:04:57 2024 +0800

        Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

        * add mmme

        * black

        * add model specific prompt and gen kwargs

        * black

        * add yaml config to supprot multi-model eval

        * print table at the end

        * refactor multi model code

        * add chartqa

        * black

        * add ai2d

        * black

        * update chartqa

        * blacl

        * update ai2d dataset

        * black

        * add qwenvl

        * add infovqa and docvqa

    * Fix error handling in loading YAML config files

    * Squashed commit of the following:

    commit 992be447a9fdf701fc910177653017e3978bf56d
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 12:41:40 2024 +0800

        Fix key bugs

    commit baf78ea27df4dfe5d88bc2abca707e117a4f9661
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:46:19 2024 +0800

        Black lint

    commit e323545d9f3a5e0f2219618a4b024aea3ff6e353
    Merge: ab898e4 fb209e4
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:45:31 2024 +0800

        Merge branch 'main' into kc/list_tasks_num

    commit dbe09071a986c68e6b2b60cbde501da8d498535f
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:44:23 2024 +0800

        Enable list all tasks num

    commit 844a47e5d49c71e5297decdf7510d8a1a214f934
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:41:32 2024 +0800

        Exclude train yaml file in the task list

    commit 89545d0
    Author: Zhang Peiyuan <a1286225768@gmail.com>
    Date:   Sun Jan 28 02:04:57 2024 +0800

        Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

        * add mmme

        * black

        * add model specific prompt and gen kwargs

        * black

        * add yaml config to supprot multi-model eval

        * print table at the end

        * refactor multi model code

        * add chartqa

        * black

        * add ai2d

        * black

        * update chartqa

        * blacl

        * update ai2d dataset

        * black

        * add qwenvl

        * add infovqa and docvqa

    * List task #num sorted

    * Update prompt messages for image-related tasks

    * Delete unused task configuration files

    * Remove coco_train.yaml configuration file

    * Update task name in mmmu.yaml

    * Fix error message for missing tasks

    * Add wandb import and integration

    ---------

    Co-authored-by: Fanyi Pu <FPU001@e.ntu.edu.sg>
    Co-authored-by: kcz358 <92624596+kcz358@users.noreply.github.com>

* Refactor CLI evaluate function and improve error logging

---------

Co-authored-by: Fanyi Pu <FPU001@e.ntu.edu.sg>
Co-authored-by: kcz358 <92624596+kcz358@users.noreply.github.com>
kangreen0210 pushed a commit to kangreen0210/LIME that referenced this issue Oct 6, 2024
* Remove unused code and configuration file

* Remove docvqa.yaml and update vizwizvqa.yaml

* lint

* Add dataset_kwargs to vizwizvqa.yaml

* Add dataset_kwargs to vizwizvqa.yaml

* textvqa (EvolvingLMMs-Lab#27)

* Update textvqa.yaml and utils.py

* Fix YAML formatting in textvqa.yaml and remove unused files

* remove useless matric

* add textvqa val & test

* Update progress bar description in evaluator.py

* Update submission file names in VizWizVQA tasks

* Update output path to include log samples suffix

* Update submission file paths in OKVQA and VizWizVQA tasks

* Refactor llava-in-the-wild.yaml and utils.py

* Update metric for llava evaluation

* Refactor logging message in Task class

* Merge commit '5553d106e5ffd84b280b3d5a3c8d47c35e2d310b'

* Fix formatting issues and add progress bar closing statements

* Update task from "infovqa_val" to "infovqa_test" in infovqa_test.yaml

* Update tqdm progress bar in OtterHD model

* Squashed commit of the following:

commit eae210c3700a59b7d5cc9de46fcb855f443096aa
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:46:19 2024 +0800

    Black lint

commit 18e4a19e82357352ab25df77b5ae4f1b011d61ae
Merge: ab898e4 fb209e4
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:45:31 2024 +0800

    Merge branch 'main' into kc/list_tasks_num

commit e899be48f55f95172fdf96bd2a98d3b91ff2aaed
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:44:23 2024 +0800

    Enable list all tasks num

commit a999fc6889c6986c28ec5d95460a4ab5233e5d4f
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:41:32 2024 +0800

    Exclude train yaml file in the task list

commit 5553d10
Author: Zhang Peiyuan <a1286225768@gmail.com>
Date:   Sun Jan 28 02:04:57 2024 +0800

    Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

    * add mmme

    * black

    * add model specific prompt and gen kwargs

    * black

    * add yaml config to supprot multi-model eval

    * print table at the end

    * refactor multi model code

    * add chartqa

    * black

    * add ai2d

    * black

    * update chartqa

    * blacl

    * update ai2d dataset

    * black

    * add qwenvl

    * add infovqa and docvqa

* Fix error handling in loading YAML config files

* Squashed commit of the following:

commit fdb0c6785b0c5d6979d10e7ddf75ce9055038db8
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 12:41:40 2024 +0800

    Fix key bugs

commit eae210c3700a59b7d5cc9de46fcb855f443096aa
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:46:19 2024 +0800

    Black lint

commit 18e4a19e82357352ab25df77b5ae4f1b011d61ae
Merge: ab898e4 fb209e4
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:45:31 2024 +0800

    Merge branch 'main' into kc/list_tasks_num

commit e899be48f55f95172fdf96bd2a98d3b91ff2aaed
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:44:23 2024 +0800

    Enable list all tasks num

commit a999fc6889c6986c28ec5d95460a4ab5233e5d4f
Author: kcz358 <92624596+kcz358@users.noreply.github.com>
Date:   Sun Jan 28 09:41:32 2024 +0800

    Exclude train yaml file in the task list

commit 5553d10
Author: Zhang Peiyuan <a1286225768@gmail.com>
Date:   Sun Jan 28 02:04:57 2024 +0800

    Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

    * add mmme

    * black

    * add model specific prompt and gen kwargs

    * black

    * add yaml config to supprot multi-model eval

    * print table at the end

    * refactor multi model code

    * add chartqa

    * black

    * add ai2d

    * black

    * update chartqa

    * blacl

    * update ai2d dataset

    * black

    * add qwenvl

    * add infovqa and docvqa

* List task #num sorted

* Update prompt messages for image-related tasks

* Delete unused task configuration files

* Remove coco_train.yaml configuration file

* Update task name in mmmu.yaml

* Fix error message for missing tasks

* Add wandb import and integration

* Update generation kwargs for LMMS tasks

* Update lmms_eval MME task configuration and utils

* Update generation_kwargs in lmms_eval tasks

* Update doc_to_text function in coco and okvqa tasks

* Add COCO 2017 version

* Update task name in coco_test2017.yaml

* Squashed commit of the following:

commit 0fd4558
Author: Zhang Peiyuan <a1286225768@gmail.com>
Date:   Mon Jan 29 22:41:33 2024 +0800

    Add/mmmu test (EvolvingLMMs-Lab#30)

    * mmmu_test

    * black

commit f125889
Author: Li Bo <drluodian@gmail.com>
Date:   Sun Jan 28 22:19:13 2024 +0800

    [Dataset Check] dataset check and add wandb logging (EvolvingLMMs-Lab#29)

    * Remove unused code and configuration file

    * Remove docvqa.yaml and update vizwizvqa.yaml

    * lint

    * Add dataset_kwargs to vizwizvqa.yaml

    * Add dataset_kwargs to vizwizvqa.yaml

    * textvqa (EvolvingLMMs-Lab#27)

    * Update textvqa.yaml and utils.py

    * Fix YAML formatting in textvqa.yaml and remove unused files

    * remove useless matric

    * add textvqa val & test

    * Update progress bar description in evaluator.py

    * Update submission file names in VizWizVQA tasks

    * Update output path to include log samples suffix

    * Update submission file paths in OKVQA and VizWizVQA tasks

    * Refactor llava-in-the-wild.yaml and utils.py

    * Update metric for llava evaluation

    * Refactor logging message in Task class

    * Merge commit '5553d106e5ffd84b280b3d5a3c8d47c35e2d310b'

    * Fix formatting issues and add progress bar closing statements

    * Update task from "infovqa_val" to "infovqa_test" in infovqa_test.yaml

    * Update tqdm progress bar in OtterHD model

    * Squashed commit of the following:

    commit eae210c3700a59b7d5cc9de46fcb855f443096aa
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:46:19 2024 +0800

        Black lint

    commit 18e4a19e82357352ab25df77b5ae4f1b011d61ae
    Merge: ab898e4 fb209e4
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:45:31 2024 +0800

        Merge branch 'main' into kc/list_tasks_num

    commit e899be48f55f95172fdf96bd2a98d3b91ff2aaed
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:44:23 2024 +0800

        Enable list all tasks num

    commit a999fc6889c6986c28ec5d95460a4ab5233e5d4f
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:41:32 2024 +0800

        Exclude train yaml file in the task list

    commit 5553d10
    Author: Zhang Peiyuan <a1286225768@gmail.com>
    Date:   Sun Jan 28 02:04:57 2024 +0800

        Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

        * add mmme

        * black

        * add model specific prompt and gen kwargs

        * black

        * add yaml config to supprot multi-model eval

        * print table at the end

        * refactor multi model code

        * add chartqa

        * black

        * add ai2d

        * black

        * update chartqa

        * blacl

        * update ai2d dataset

        * black

        * add qwenvl

        * add infovqa and docvqa

    * Fix error handling in loading YAML config files

    * Squashed commit of the following:

    commit fdb0c6785b0c5d6979d10e7ddf75ce9055038db8
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 12:41:40 2024 +0800

        Fix key bugs

    commit eae210c3700a59b7d5cc9de46fcb855f443096aa
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:46:19 2024 +0800

        Black lint

    commit 18e4a19e82357352ab25df77b5ae4f1b011d61ae
    Merge: ab898e4 fb209e4
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:45:31 2024 +0800

        Merge branch 'main' into kc/list_tasks_num

    commit e899be48f55f95172fdf96bd2a98d3b91ff2aaed
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:44:23 2024 +0800

        Enable list all tasks num

    commit a999fc6889c6986c28ec5d95460a4ab5233e5d4f
    Author: kcz358 <92624596+kcz358@users.noreply.github.com>
    Date:   Sun Jan 28 09:41:32 2024 +0800

        Exclude train yaml file in the task list

    commit 5553d10
    Author: Zhang Peiyuan <a1286225768@gmail.com>
    Date:   Sun Jan 28 02:04:57 2024 +0800

        Add InfoVQA, DocVQA, and QwenVL (EvolvingLMMs-Lab#28)

        * add mmme

        * black

        * add model specific prompt and gen kwargs

        * black

        * add yaml config to supprot multi-model eval

        * print table at the end

        * refactor multi model code

        * add chartqa

        * black

        * add ai2d

        * black

        * update chartqa

        * blacl

        * update ai2d dataset

        * black

        * add qwenvl

        * add infovqa and docvqa

    * List task #num sorted

    * Update prompt messages for image-related tasks

    * Delete unused task configuration files

    * Remove coco_train.yaml configuration file

    * Update task name in mmmu.yaml

    * Fix error message for missing tasks

    * Add wandb import and integration

    ---------

    Co-authored-by: Fanyi Pu <FPU001@e.ntu.edu.sg>
    Co-authored-by: kcz358 <92624596+kcz358@users.noreply.github.com>

* Refactor CLI evaluate function and improve error logging

---------

Co-authored-by: Fanyi Pu <FPU001@e.ntu.edu.sg>
Co-authored-by: kcz358 <92624596+kcz358@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants