AI Chat Settings - On-device | ArchitectBuddies

top of page

Architect Buddies™

Step 1 - Select 'On-device LLM' from the AI Selection list

Step 2 - Select a model from the Model Selection list

If you have already downloaded a model, it will appear under Model Selection

If you only see 'No Model Selected' then you will need to download a model

The app provides an option to download a suggested model, or if you have already downloaded one, load it from the file system.
You also have the ability to download a language model (e.g. from HuggingFace) in the .gguf format by providing a URL to a model
Selecting a model causes it to load into your device's memory. Large models take a long time. A green tick badge will display next to the model once it has loaded.

Step 3 - Configure your local AI Model

This can be tricky when using devices with limited memory. 16GB is the minimum recommended. Changes to settings can cause undefined behaviour

Adjust the slider on the Context Size. If suitable, an 'Apply Recommended Settings' button should appear.
Look at the provided model information for the memory pressure. Do not exceed the GPU budget as this will cause undefined behaviour.

Remember that large, more capable models require lots of memory and processing power. We recommended that you use Cloud-based services to get the best behaviour out of your assistants.

The capabilities of small language models are improving all the time. Sign up to keep up to date on the best models to try. Sign up here.

bottom of page