You're looking at a specific version of this model. Jump to the model overview.

ictnlp /llama-omni:36c9bcf7

Input schema

The fields you can use to run this model with an API. If you don’t give a value for a field its default value will be used.

Field Type Default value Description
input_audio
string
Input audio
prompt
string
Please directly answer the questions in the user's speech
None
temperature
number
0

Max: 1

Controls randomness. Lower values make the model more deterministic, higher values make it more random.
top_p
number
0

Max: 1

Controls diversity of the output. Valid when temperature > 0. Lower values make the output more focused, higher values make it more diverse.
max_new_tokens
integer
256

Min: 1

Maximum number of tokens to generate

Output schema

The shape of the response you’ll get when you run this model with an API.

Schema
{'properties': {'audio': {'format': 'uri', 'title': 'Audio', 'type': 'string'},
                'text': {'title': 'Text', 'type': 'string'}},
 'required': ['audio', 'text'],
 'title': 'ModelOutput',
 'type': 'object'}