Model name: t5_local

About T5

T5 is a text-to-text transformer model that converts input text to output text. It's based on the Transformer architecture where an encoder processes the input and a decoder then generates the output. It's pre-trained on a large corpus of text data and is capable of performing a wide range of natural language processing tasks, such as chatbots, translation, summarization, and question answering.

Read more about T5 on Wikipedia.

Supported aidb operations

  • encode_text
  • decode_text
  • encode_text_batch
  • decode_text_batch

Supported models

  • t5-small (default)

See more in the Support matrix.

Creating the default model

SELECT aidb.create_model('my_t5_model', 't5_local');

Creating a specific model

SELECT aidb.create_model( 'another_t5_model', 't5_local', '{"model": "t5-large", "revision": "main"}'::JSONB )

Model configuration settings

The following configuration settings are available for T5 models:

  • model — The T5 model to use. The default is t5-small.
  • revision — The revision of the model to use. The default is refs/pr/15.
  • temperature — The temperature to use for sampling. The default is 0.0.
  • seed — The random seed to use for sampling. The default is 1599222198345926291.
  • max_tokens — The maximum number of tokens to generate. The default is 16384.
  • repeat_penalty — The repetition penalty to use. The default is 1.1.
  • repeat_last_n — The number of tokens to consider for the repetition penalty. The default is 64.

Model credentials

No credentials are required for the T5 model.