Save pass@k result & use custom tokenizer #20

marianna13 · 2024-07-08T13:38:38Z

Hey there!

Two changes that would be nice to have:

Save pass@k result after evaluation (in a JSON file).
The ability to load custom tokenizer with HF AutoTokenizer (i.e. if tokenizer name is different from the model name).

…into marianna

terryyz · 2024-07-09T09:17:58Z

bigcodebench/evaluate.py

@@ -277,6 +277,12 @@ def stucking_checker():
    if not os.path.isfile(result_path):
        with open(result_path, "w") as f:
            json.dump(results, f, indent=2)
+


Maybe adding if not os.path.isfile(pass_at_k_path):?

Actually, I think a better way is to check if at least the Pass@1 scores are the same and decide whether we need to rewrite the result_path and pass_at_k_path. Wdyt?

yes, sounds good

Thanks! I'll merge the PR after your update.

terryyz · 2024-07-09T09:20:58Z

@marianna13 LGTM, only minor updates are needed. Did you test w/ these changes with some models?

marianna13 · 2024-07-09T15:19:17Z

I tried with ibm-granite/granite-3b-code-base and openai-community/gpt2 tokenizer. Also I tried apple/OpenELM-1_1B and meta-llama/Llama-2-7b-hf tokenizer ( it didn't work but it's not bc of the tokenizer, there's a problem with OpenELM config)

marianna13 added 4 commits July 8, 2024 15:28

save pass@k to json file

32324d2

add tokenizer_name argument for custom tokenizer

b2a14b6

add custom tokenzier

96aafc0

Merge branch 'main' of https://github.com/bigcode-project/bigcodebench …

d1cb2fe

…into marianna

terryyz self-requested a review July 8, 2024 14:38

terryyz reviewed Jul 9, 2024

View reviewed changes

ask user whther to save pass@k

7d9e4fc

terryyz approved these changes Jul 9, 2024

View reviewed changes

terryyz merged commit bbe93d6 into bigcode-project:main Jul 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Save pass@k result & use custom tokenizer #20

Save pass@k result & use custom tokenizer #20

marianna13 commented Jul 8, 2024

terryyz Jul 9, 2024

terryyz Jul 9, 2024

marianna13 Jul 9, 2024

terryyz Jul 9, 2024

marianna13 Jul 9, 2024

terryyz commented Jul 9, 2024

marianna13 commented Jul 9, 2024

Save pass@k result & use custom tokenizer #20

Save pass@k result & use custom tokenizer #20

Conversation

marianna13 commented Jul 8, 2024

terryyz Jul 9, 2024

Choose a reason for hiding this comment

terryyz Jul 9, 2024

Choose a reason for hiding this comment

marianna13 Jul 9, 2024

Choose a reason for hiding this comment

terryyz Jul 9, 2024

Choose a reason for hiding this comment

marianna13 Jul 9, 2024

Choose a reason for hiding this comment

terryyz commented Jul 9, 2024

marianna13 commented Jul 9, 2024