[LLM] Add Yuan model #8654

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged

wawltor merged 20 commits into PaddlePaddle:develop from zhaogf01:add-yuan-model

Jul 11, 2024

Contributor

zhaogf01 commented Jun 25, 2024

PR types

New features

PR changes

Models

Description

添加了源2.0的模型结构、配置等相关文件

zhaogf01 added 7 commits

June 24, 2024 17:37

add yuan model

55e0c75

add yuan model settings

04aaaa0

fix conflict

9bad9a8

add readme

0f34595

update format

1a152c9

update readme

d9e2a4a

update readme

7e80c59

paddle-bot bot commented Jun 25, 2024

Thanks for your contribution!

paddle-bot bot added the contributor label

paddle-bot bot assigned DesmonDay

Collaborator

DrownFish19 commented Jun 25, 2024

Lint问题可以参考link进行修复

zhaogf01 added 3 commits

June 25, 2024 16:31

update for lint

c7577c4

update for lint

056cdcb

update for lint

52a9556

Contributor Author

zhaogf01 commented Jun 25, 2024

我看lint的日志中引起black、isort、copyright_checker failed的文件已经被修改了，请问我还需要修改吗？或者这个错误具体指什么？我没看到引起错误的源文件？
另外，test中的错误需要处理吗？请问这个错误具体是指哪个文件？我没太看懂。

Collaborator

DrownFish19 commented Jun 25, 2024 •

edited

Loading

我看lint的日志中引起black、isort、copyright_checker failed的文件已经被修改了，请问我还需要修改吗？或者这个错误具体指什么？我没看到引起错误的源文件？

具体是格式错误问题，PR中的文件需要满足要求格式。可以本地使用pip install pre-commit && pre-commit install并使用pre-commit run --file XXX.py格式化本地代码并上传。

另外，test中的错误需要处理吗？请问这个错误具体是指哪个文件？我没太看懂。

辛苦拉一下最近代码即可，最新commit已经修复。

update for lint

afe3f4e

codecov bot commented Jun 26, 2024 •

edited

Loading

Codecov Report

Attention: Patch coverage is 14.11043% with 560 lines in your changes missing coverage. Please review.

Project coverage is 55.42%. Comparing base (6d464bf) to head (3677479).
Report is 222 commits behind head on develop.

Files with missing lines	Patch %	Lines
paddlenlp/transformers/yuan/modeling.py	13.37%	544 Missing ⚠️
paddlenlp/transformers/yuan/configuration.py	23.80%	16 Missing ⚠️

Additional details and impacted files

@@ Coverage Diff @@ ## develop #8654 +/- ## =========================================== - Coverage 55.74% 55.42% -0.32%  =========================================== Files 623 626 +3 Lines 97456 98057 +601 =========================================== + Hits 54323 54351 +28  - Misses 43133 43706 +573

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

add fp16

8b7878b

Contributor Author

zhaogf01 commented Jun 27, 2024

请问目前的三个failed应该如何修改？

NINGBENZHE reviewed

View reviewed changes

paddlenlp/transformers/yuan/utils/tensor_parallelism_tools.py

      # See the License for the specific language governing permissions and  
    # limitations under the License.  
    
    """ Yuan model tools"""

Contributor

NINGBENZHE Jun 28, 2024

这串代码建议封装成函数，避免出现2，24这样的魔鬼数字，明文模型路径等；
建议搞成参数传入；

Contributor Author

zhaogf01 Jul 1, 2024

已修改。请问codecov中的warning应该如何修改？

Contributor

NINGBENZHE Jul 1, 2024

这个应该不影响代码合入，找commiter帮忙合入就可以了

Contributor Author

zhaogf01 Jul 1, 2024

@DesmonDay 如果没有什么问题，麻烦帮忙合入，谢谢

update utils

aaf1fb7

DrownFish19 reviewed

View reviewed changes

.pre-commit-config.yaml Outdated

      repos:  
    # For Python files  
    - repo: https://github.com/psf/black.git  
    - repo: https://gitee.com/wygfzren/black.git  
 

Collaborator

DrownFish19 Jul 1, 2024

此文件请修改回原版本，会影响其他贡献者对代码进行格式化

Contributor Author

zhaogf01 Jul 4, 2024

已修改

paddlenlp/transformers/__init__.py Outdated

      from .deberta_v2.configuration import *  
    from .qwen2 import *  
    from .qwen2_moe import *  
    from .yuan.modeling import *  
 

Collaborator

DrownFish19 Jul 1, 2024

请使用 from .yuan import * 进行导入，并在yuan文件夹下的__init__.py中import 导入modeling和configuration

Contributor Author

zhaogf01 Jul 4, 2024

已修改

paddlenlp/transformers/yuan/modeling.py Outdated

      from paddle.distributed import fleet  
    from paddle.nn import CrossEntropyLoss  
    
    from paddlenlp.transformers.conversion_utils import (

Collaborator

DrownFish19 Jul 1, 2024

paddlenlp库内部函数和类，推荐使用相对路径import

Contributor Author

zhaogf01 Jul 4, 2024

已修改

paddlenlp/transformers/yuan/modeling.py Outdated

       return q_embed, k_embed  
    
    class YuanPreTrainedModel(PretrainedModel):

Collaborator

DrownFish19 Jul 1, 2024

此处需要修改模型名称为YuanPretrainedModel，否则在auto import时会产生报错。当前PaddleNLP导入规则为模型自定义名称（如Qwen2， Yuan） + 固定类型（如PretrainedModel, ForCausalLM）

Contributor Author

zhaogf01 Jul 4, 2024

已修改

paddlenlp/transformers/yuan/modeling.py Outdated

       model_mappings.extend(layer_mappings)  
    
     init_name_mappings(mappings=model_mappings)  
     # base-model prefix "LlamaModel"

Collaborator

DrownFish19 Jul 1, 2024

这一行修改或者删除

Contributor Author

zhaogf01 Jul 4, 2024

已修改

paddlenlp/transformers/yuan/modeling.py Outdated

       if module._padding_idx is not None:  
     module.weight.data[module._padding_idx].zero_()  
    
     def _set_gradient_checkpointing(self, module, value=False):

Collaborator

DrownFish19 Jul 1, 2024

PaddleNLP中使用recompute来控制重计算，可参考llama相关重计算设置

Contributor Author

zhaogf01 Jul 4, 2024

已修改

paddlenlp/transformers/yuan/modeling.py

      
     hidden_states = inputs_embeds  
    
     if self.gradient_checkpointing and self.training:

Collaborator

DrownFish19 Jul 1, 2024

此处判断参数应为recompute，实现细节可参考llama代码

Contributor Author

zhaogf01 Jul 4, 2024

已修改

DrownFish19 changed the title ~~Add yuan model~~ [LLM] Add Yuan model

zhaogf01 and others added 5 commits

July 4, 2024 06:10

fix bug

f2b0f3c

Merge branch 'PaddlePaddle:develop' into add-yuan-model

8fa2cd7

format

42f87f2

correct pre-commit

e409997

correct fp16

f5e6a57

Contributor Author

zhaogf01 commented Jul 9, 2024

请问，test测试不通过应该如何修改？
其次，请问其他修改是否合格？

Collaborator

DrownFish19 commented Jul 10, 2024

请问，test测试不通过应该如何修改？其次，请问其他修改是否合格？

非常感谢您的贡献。

Test Cases出现问题的原因是Yuan modeling中import einops导致，考虑动转静等后续推理流程，建议修改einops.arrange为paddle.reshape操作，而不是在setup.py中引入einops。
paddlenlp/transformers/yuan/utils/tensor_parallelism_tools.py并非必须文件，如果存在完整模型参数，paddlenlp在模型加载过程中会自动切分参数实现模型并行（tensor parallel）。
修改后Test和CI通过即可合入。

delete rearrange

75c8acb

Contributor Author

zhaogf01 commented Jul 10, 2024

请问，test测试不通过应该如何修改？其次，请问其他修改是否合格？

非常感谢您的贡献。

Test Cases出现问题的原因是Yuan modeling中import einops导致，考虑动转静等后续推理流程，建议修改einops.arrange为paddle.reshape操作，而不是在setup.py中引入einops。

paddlenlp/transformers/yuan/utils/tensor_parallelism_tools.py并非必须文件，如果存在完整模型参数，paddlenlp在模型加载过程中会自动切分参数实现模型并行（tensor parallel）。

修改后Test和CI通过即可合入。

1、已修改
2、此处是需要处理的，原因在于源2.0的attention中的reshape将q和k混在了一起。paddlenlp在模型加载过程中是会自动切分参数，但由于上述源2.0结构的特殊，导致在gather的时候q_state和k_state混在一起。这个工具的作用就是提前将权重重组，可以避免上述问题。
谢谢！

DrownFish19 previously approved these changes

View reviewed changes

Collaborator

DrownFish19 left a comment

LGTM

add pre_train

3677479

zhaogf01 dismissed DrownFish19’s stale review via 3677479

July 10, 2024 09:16

PaddlePaddle locked and limited conversation to collaborators

PaddlePaddle unlocked this conversation

DrownFish19 approved these changes

View reviewed changes

wawltor merged commit 1af227a into PaddlePaddle:develop

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment