gqa fuse attention qkv #7890

FeixLiu · 2024-01-24T01:43:15Z

PR types

Others

PR changes

Others

Description

gqa fuse attention qkv

7B model, non_fuse vs fuse

3.5B model, 4mp vs 8mp

paddle-bot · 2024-01-24T01:43:19Z

Thanks for your contribution!

codecov · 2024-01-24T02:19:34Z

Codecov Report

Attention: 7 lines in your changes are missing coverage. Please review.

Comparison is base (44bfeb0) 56.80% compared to head (5026243) 56.57%.
Report is 1 commits behind head on develop.

Files	Patch %	Lines
paddlenlp/transformers/llama/modeling.py	22.22%	7 Missing ⚠️

Additional details and impacted files

@@ Coverage Diff @@ ## develop #7890 +/- ## =========================================== - Coverage 56.80% 56.57% -0.23%  =========================================== Files 588 589 +1 Lines 89536 91330 +1794 =========================================== + Hits 50858 51674 +816  - Misses 38678 39656 +978

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

paddlenlp/transformers/llama/modeling.py

ZHUI

LGTM

ZHUI reviewed Jan 29, 2024

View reviewed changes

paddlenlp/transformers/llama/modeling.py Show resolved Hide resolved

paddlenlp/transformers/llama/modeling.py Show resolved Hide resolved

FeixLiu force-pushed the support_gqa_fusion branch from 7b04017 to 896c01f Compare January 29, 2024 23:15

FeixLiu added 2 commits January 31, 2024 17:09

gqa fuse attention qkv

65d1599

add annotation for the fusion

5026243

FeixLiu force-pushed the support_gqa_fusion branch from c79b7e2 to 5026243 Compare January 31, 2024 09:10

ZHUI approved these changes Jan 31, 2024

View reviewed changes

FeixLiu closed this Jan 31, 2024

FeixLiu reopened this Jan 31, 2024

wawltor merged commit c0c64fa into PaddlePaddle:develop Feb 1, 2024

FeixLiu deleted the support_gqa_fusion branch February 1, 2024 04:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

gqa fuse attention qkv #7890

gqa fuse attention qkv #7890

Uh oh!

FeixLiu commented Jan 24, 2024 •

edited

Loading

paddle-bot bot commented Jan 24, 2024

codecov bot commented Jan 24, 2024 •

edited

Loading

Uh oh!

Uh oh!

ZHUI left a comment

Labels

3 participants

gqa fuse attention qkv #7890

gqa fuse attention qkv #7890

Uh oh!

Conversation

FeixLiu commented Jan 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR types

PR changes

Description

7B model, non_fuse vs fuse

3.5B model, 4mp vs 8mp

paddle-bot bot commented Jan 24, 2024

codecov bot commented Jan 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

ZHUI left a comment

Choose a reason for hiding this comment

Labels

3 participants

FeixLiu commented Jan 24, 2024 •

edited

Loading

codecov bot commented Jan 24, 2024 •

edited

Loading