Hire a web Developer and Designer to upgrade and boost your online presence with cutting edge Technologies

Friday, 2 June 2023

Beginners Guide to Flan-T5

 Flan-T5 is an enhanced version of Google’s T5 AI model which is quite good at certain language tasks.

For example, it’s supposed to be better at a lot of zero-shot examples even than GPT-3.

Install and Setup Flan-T5

First, we install the transformers module by running the below command
pip install transformers

If you run into an issue, check out the Transformers Installation Guide.
Then, we load the Flan-T5 model
from transformers import AutoModelForSeq2SeqLM, AutoTokenizer

model = AutoModelForSeq2SeqLM.from_pretrained("google/flan-t5-small")
Load the tokenizer
tokenizer = AutoTokenizer.from_pretrained("google/flan-t5-small")
Note we loaded the small model.  You can load any of the versions listed in a later section below.
The bigger the model, the better it is but also more resource consuming

Using Flan-T5 for language AI tasks

Next, we pass the prompt we want the AI model to generate text for.
inputs = tokenizer("A intro paragraph on a article on space travel:", return_tensors="pt")
We call the model’s generate function and get the response.
outputs = model.generate(**inputs)
You can then print the output from the AI model
print(tokenizer.batch_decode(outputs, skip_special_tokens=True))

Flan-T5 versions

ModelGin File LocationCheckpoint Location
Flan-T5 Smallt5_1_1/small.gings://t5-data/pretrained_models/t5x/flan_t5_small/checkpoint_1198000
Flan-T5 Baset5_1_1/base.gings://t5-data/pretrained_models/t5x/flan_t5_base/checkpoint_1184000
Flan-T5 Larget5_1_1_large.gings://t5-data/pretrained_models/t5x/flan_t5_large/checkpoint_1164000
Flan-T5 XLt5_1_1_xl.gings://t5-data/pretrained_models/t5x/flan_t5_xl/checkpoint_1138000
Flan-T5 XXLt5_1_1_xxl.gings://t5-data/pretrained_models/t5x/flan_t5_xxl/checkpoint_1114000

No comments:

Post a Comment

Connect broadband

Training-validation-test split and cross-validation done right

  One crucial step in machine learning is the choice of model. A suitable model with suitable hyperparameter is the key to a good prediction...