Skip to content

OllamaRequest

Defined in: adapters/ollama.ts:55

Ollama API request structure for local models.

Ollama provides a local server for running LLMs. Parameters are nested in an options object, and uses num_predict instead of max_tokens.

const request: OllamaRequest = {
model: 'llama2',
messages: [{ role: 'user', content: 'Hello!' }],
options: {
temperature: 0.7,
num_predict: 100
}
};

messages: OllamaMessage[]

Defined in: adapters/ollama.ts:60

Conversation messages


model: string

Defined in: adapters/ollama.ts:57

Local model name (e.g., ‘llama2’, ‘mistral’, ‘codellama’)


optional options: object

Defined in: adapters/ollama.ts:63

Model parameters nested in options object

optional num_predict: number

Maximum tokens to predict (Ollama uses ‘predict’ not ‘tokens’)

optional stop: string[]

Stop sequences

optional temperature: number

Sampling temperature

optional top_k: number

Top-K sampling

optional top_p: number

Nucleus sampling


optional stream: boolean

Defined in: adapters/ollama.ts:81

Enable streaming responses