Added ROUGE-L metric for evaluation by YourSaDady · Pull Request #500 · OptimalScale/LMFlow

YourSaDady · 2023-06-18T03:55:36Z

Added a new evaluation metric called ROUGE-L (https://github.com/yizhongw/self-instruct).
To apply: run_evaluation_with_rougel.sh
Test case: rougel_test_case.sh

Added ROUGE-L evaluation.

A script similar to run_evaluation_with_lora.sh

.

changed line 473, added choices of "rl", "rouge-l", and "ROUGE-L" to apply the ROUGE-L metric.

model: from facebook/galactica-1.3b to gpt2_large

.

...

Combine similar accuracy and ROUGE-L metrics into one metric

Similar to evaluate.py

YourSaDady added 30 commits April 30, 2023 10:44

Added ROUGE-L evaluation.

290be18

Added ROUGE-L evaluation.

Evaluation with ROUGE-L

56ae992

A script similar to run_evaluation_with_lora.sh

typo fixed

2aa8e02

.

Appended choices lists with ROUGE-L funtions

3e321b2

changed line 473, added choices of "rl", "rouge-l", and "ROUGE-L" to apply the ROUGE-L metric.

changed the model_path and deleted lora_model_path

61b38a5

model: from facebook/galactica-1.3b to gpt2_large

imported deepspeed

6bc78ca

.

typo fixed

3a5c0fc

...

Add files via upload

c88e18f

Simplify the evaluator.py

ee17f1a

Combine similar accuracy and ROUGE-L metrics into one metric

for testing

5971fa7

the original evaluator file before June 4th

1ab1afe

the original version before June 4th

64c3acf

Newest version of the team

dcac238

Update args.py

bdcc9f8

Update run_evaluation_with_rougel.sh

5b6fb41

a test case for ROUGE-L metric in evaluation

545c808

test case for ROUGE-L

72500db

update

5851f47

test ROUGE-L metric

9c5fd69

Added the function of evaluate.py

4861bcc

updated

f0304f5

Add files via upload

7fc95e7

import evaluate, not AutoPipeline

dd5005e

import Test_rougel

621059a

Add files via upload

f74cb75

import Test_rougel

b1a615d

Add inference_batch_size_per_device

19fd37e

test for the validity of ROUGE-L metric

ff58720

split test.py and test_rougel.py

f46e99b

deleted the part of test.py

c3f09e6

YourSaDady added 13 commits June 8, 2023 14:49

Added inference_batch_size_per_device

782ebcf

copied the function of Auto_pipeline inside

91e61cf

Update test.py

4c51c35

tiny updates

73c1d2a

fix issues of JSON dataset format

ddc294d

fix the Pool issue

f088ab6

fix Pool issue

b6f703f

fix weird precision error in comparison of scores

70cb3c6

similar to evaluate.py

0a29167

Similar to evaluator.py

727c82d

Update test.py

48a392b

Similar to evaluate.py

Update test_rougel.py

eefad15

Update rougel_test_case.sh

b5abd56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added ROUGE-L metric for evaluation#500

Added ROUGE-L metric for evaluation#500
YourSaDady wants to merge 43 commits intoOptimalScale:mainfrom
YourSaDady:main

YourSaDady commented Jun 18, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

YourSaDady commented Jun 18, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant