Skip to content

lamini.Lamini.upload_data

Upload data for training. This method gets called inside lamini.Lamini.train. You can use this method directly to get dataset id's and upload data for training without kicking off a training job.

Specify the data as an argument to lamini.Lamini.upload_data

data = [
    {"input": "What's your favorite animal?", "output": "dog"},
    {"input": "What's your favorite color?", "output": "blue"},
    ...
]
llm = Lamini(model_name="meta-llama/Meta-Llama-3-8B-Instruct")
llm.upload_data(data)

This will print out a dataset_id which can be used to persist the data across multiple runs.

Your dataset id is: 85e27a2e28bbfd526d0f7bb902b2a987b1f7632b556d77c02ca06dad76e57fcc . Consider using this in the future to train using the same data.
Eg: llm.train(dataset_id='85e27a2e28bbfd526d0f7bb902b2a987b1f7632b556d77c02ca06dad76e57fcc')

Uploading data....
Upload to blob completed for data.
Data pairs uploaded to blob.

Persisted data will be evicted after a month of storage, and we recommend that you manage your data separately.