Installing Ollama and getting started with local AI

Recently, I keep seeing issues with Claude token usage and cost.

Suddenly, I realized that I might have to use AI offline when I'm not online.

It's like when people used to buy mp3s, even though they could listen to them through streaming.

So I first thought about installing DeepSea, and then I looked into which model would work for me.

My computer is a MacBook (16GB), so it's not a high-end machine. I wanted something that fits my computer, can help with coding, and can hold a decent conversation in Korean.

Which model is right for my MacBook?

1. Qwen2.5-Coder (큐멘2.5 코oder)

This is Alibaba's big hit in China and currently dominates the local coding model market. Although it was trained mainly on English and Chinese, the Qwen2.5 series includes a lot of multilingual data by default, so it can understand Korean questions quite accurately and handle Korean-annotated code well.

Recommended size: qwen2.5-coder:7b (about 4.7GB)
Features: It can write, debug, and refactor code in ways that commercial models like GPT-4o can't easily match. It runs a little hot on a 16GB MacBook, but as a coding assistant it's the most reliable.

ollama run qwen2.5-coder:7b

2. EXAONE 3.5 (엑사원 3.5)

This is a Korean AI model developed by LG AI Research and released as open source. Because it was made by a major Korean company, it is very strong at understanding Korean context and Korean coding questions.

Recommended size: exaone3.5:7.8b (about 4.8GB)
Features: Korean conversation quality is excellent. Even though it is not a coding-specific model, it works well for explaining development concepts in Korean and writing basic to intermediate programming code.

ollama run exaone3.5:7.8b

Installing

1. Install from the official Ollama site

When you visit ollama.com, the installation command is shown right away.

Ollama 공식 사이트

Copy the command from the site and paste it into your terminal.

curl -fsSL https://ollama.com/install.sh | sh

Ollama 설치 진행 화면

2. Check the usage

Ollama 사용법

3. Install the models

qwen2.5-coder:7b

qwen2.5-coder 설치

ollama run qwen2.5-coder:7b

deepseek-r1:1.5b

My computer specs recommended the 1.5B model, so I installed that as well. It's lightweight at 1.1 GB.

deepseek-r1 설치

ollama run deepseek-r1:1.5b

Run screen

Ollama 실행 화면

It's definitely slower than Claude or Codex. Still, I'll give it a try.