fix sequence length in santacoder and introduce new model type
#23
by mayank-mishra - opened
Adds a new model_type to the config. Currently this is gpt2
which creates problems with huggingface/optimum.
Fix sequence length bug which is not seen in transformers but in ONNX because transformers's generate method passes position_ids itself whereas when running with ONNX, the model needs to infer it itself.
mayank-mishra changed pull request status to open