Benchmark Code #27

szeyu · 2024-08-15T07:22:16Z

Benchmark

Allow users to test on themselves to get the benchmark of model(s) on different backend. It will analyse the Token In / Out throughput for you in a statistical manner

Benchmark a Model

To benchmark a model, run this

--backend cpu | ipex | openvino | directml
--model_name Name of the Model
--model_path Path to Model | Model Repo ID
--token_in Number of Input Tokens (Max 2048)
--token_out Number of Output Tokens

python ellm_benchmark.py --backend <cpu | ipex | openvino | directml> --model_name <Name of the Model> --model_path <Path to Model | Model Repo ID> --token_in <Number of Input Tokens (Max 2048)> --token_out <Number of Output Tokens>

Loop to benchmark the models

Customise your benchmarking config

# Define the models model_names = [ # model names ] # Define the model paths model_paths = [ # path to model in order to model names / model repo id ] # Define the token length token_in_out = [ (1024, 1024), (1024, 512), (1024, 256), (1024, 128), (512, 1024), (512, 512), (512, 256), (512, 128), (256, 1024), (256, 512), (256, 256), (256, 128), (128, 1024), (128, 512), (128, 256), (128, 128), ] # Choose backend backend = "cpu" backend = "directml" backend = "ipex" backend = "openvino" # Number of loops loop_count = 20

python loop_ellm_benchmark.py

Generate a Report (`XLSX`) of a Model's Benchmark

To Generate report for a model, run this

--model_name Name of the Model

python analyse_detailed_benchmark.py --model_name <Name of the Model>

Generate Reports (`XLSX`) of Models' Benchmark

List out the models that you want to have report of benchmarking

model_names = [ # model names ]

python loop_analyse_detailed_benchmark.py

…delui

…beddedllm into szeyu-benchmark-2

… in every loop

szeyu and others added 23 commits August 6, 2024 16:15

update new model list with new reuploaded model and ipex option in mo…

50f4c34

…delui

fix the typo of mistral repo id

ec2f421

edit to the latest version of models available

dbdefa0

change the context length of 128k to 131072

0965d51

Merge branch 'main' into szeyu-autoloader-1

b905aa9

onnx auto download model if repo id is provided as model path

5dbf495

formated with black

fb2c63e

fixed with flake8

f8c8f27

add openvino description and the device gpu

d54b4d8

update openvino in modelui list

1c3b393

first commit of benchmark code

75eff7c

update for the markdown to teach about benchmark code usage

608670c

Rename benchmark.md to README.md

21d95aa

Update README.md

a038376

fixed the bias for encode and output_token_length for openvino

ca93ba9

Merge branch 'szeyu-benchmark-2' of https://github.com/EmbeddedLLM/em…

79af320

…beddedllm into szeyu-benchmark-2

Update loop_ellm_benchmark.py

632d651

add prompt bias to fix the token encode margin error for directml

e51527d

Update ellm_benchmark.py

013adc4

Update loop_ellm_benchmark.py

62e0b2c

Update README.md

03cc6b7

update the benchmark loop to loop without having the model load again…

4998e2c

… in every loop

Update loop_ellm_benchmark.py

769e558

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Benchmark Code #27

Benchmark Code #27

Uh oh!

szeyu commented Aug 15, 2024

Labels

1 participant

Benchmark Code #27

Are you sure you want to change the base?

Benchmark Code #27

Uh oh!

Conversation

szeyu commented Aug 15, 2024

Benchmark

Benchmark a Model

Loop to benchmark the models

Generate a Report (XLSX) of a Model's Benchmark

Generate Reports (XLSX) of Models' Benchmark

Labels

1 participant

Generate a Report (`XLSX`) of a Model's Benchmark

Generate Reports (`XLSX`) of Models' Benchmark