Could you early release the evaluation scripts with vicuna model. #1

KerolosAtef · 2023-12-05T02:17:30Z

No description provided.

avinash31d · 2023-12-05T03:36:08Z

+1

shehanmunasinghe · 2023-12-10T14:03:16Z

@KerolosAtef @avinash31d , Thank you for your interest in our work. Please find the details about the Vicuna-based quantitative evaluation benchmark here: https://github.com/mbzuai-oryx/Video-LLaVA/tree/main/quantitative_evaluation.

KerolosAtef · 2023-12-11T19:45:42Z

thank you very much, but also the Vicuna model doesn't output the same results for each run.

I have tried to reproduce some of the results of video chat GPT and this the results:
ActivityNet : Acc :36.13 instead of 40.8
TGIF: Acc: 63.07 instead of 66.5

shehanmunasinghe · 2023-12-27T11:29:09Z

@KerolosAtef We attribute this to the randomness introduced by the temperature parameter in both the tested model and the LLM used for evaluation. This will be addressed in our future work.

KerolosAtef · 2023-12-27T11:35:08Z

okay good,
I want to make sure of something, for the Zeroshot datasets (MSVD, MSR-VTT,Activity_net,TGIF) Are you used the testing data or the validation data?

shehanmunasinghe · 2023-12-28T17:58:50Z

We follow the same approach as Video-ChatGPT, i.e. using test splits.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Could you early release the evaluation scripts with vicuna model. #1

Could you early release the evaluation scripts with vicuna model. #1

KerolosAtef commented Dec 5, 2023

avinash31d commented Dec 5, 2023

shehanmunasinghe commented Dec 10, 2023

KerolosAtef commented Dec 11, 2023

shehanmunasinghe commented Dec 27, 2023

KerolosAtef commented Dec 27, 2023

shehanmunasinghe commented Dec 28, 2023

Could you early release the evaluation scripts with vicuna model. #1

Could you early release the evaluation scripts with vicuna model. #1

Comments

KerolosAtef commented Dec 5, 2023

avinash31d commented Dec 5, 2023

shehanmunasinghe commented Dec 10, 2023

KerolosAtef commented Dec 11, 2023

shehanmunasinghe commented Dec 27, 2023

KerolosAtef commented Dec 27, 2023

shehanmunasinghe commented Dec 28, 2023