Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.ardor.cloud/llms.txt

Use this file to discover all available pages before exploring further.

How Context Works

When conversations grow longer, they exceed the model’s context window limit:
1

User

First message from user
2

Cerebrum

Response from Cerebrum
3

User

Another message
4

⚠️ Context window limit

Messages below exceed the limit
5

Cerebrum

❌ Can’t fit
6

User

❌ Can’t fit
To solve this, Cerebrum summarizes older messages to make room for new conversations:
1

📦 Summarized Messages

Older messages compressed into summary
2

Cerebrum

Recent response (with overlap from previous context)
3

User

✅ Fits within limit
4

Cerebrum

✅ Fits within limit

Automatic Summarization

Cerebrum uses a sliding window with overlap approach. The overlap ensures context continuity - when summarizing, Cerebrum preserves context from the previous window so important connections aren’t lost.
WindowMessagesOverlap
11, 2, 3
23, 4, 5, 6Message 3 from Window 1
36, 7, 8, 9Message 6 from Window 2
Summarization happens automatically in the background. You don’t need to do anything special.

What Gets Preserved

When Cerebrum summarizes context, it prioritizes:
PriorityInformation TypeExample
CriticalArchitecture decisions”Using PostgreSQL for the database”
CriticalRequirements from PRD”Must support 1000 concurrent users”
HighService configurations”Backend runs on port 3000”
HighCurrent task context”Working on user authentication”
MediumPrevious solutions”Fixed CORS issue by adding headers”
LowExploratory discussions”Considered Redis but chose PostgreSQL”

Best Practices

When context seems lost, ask Cerebrum to “look at the Canvas” - it shows the current environment, all services, and their configurations.
The Canvas displays the state of a specific environment. Make sure you’re looking at the right env (dev/staging/prod) when discussing changes.
For complex discussions, provide your own summary: “To recap, we decided to use X because Y.”
For very large projects, consider completing major milestones before moving on. This creates natural breakpoints.

Context Limits by Model

Different models have different context windows:
ModelContext Window
Gemini 3.1 Pro1M tokens
Gemini 3.5 Flash1M tokens
GPT-5.41.05M tokens
Grok 4.202M tokens
Claude Sonnet 4.6200K tokens
Claude Opus 4.6200K tokens
Claude Opus 4.71M tokens
Larger context windows mean less frequent summarization, but all models benefit from Cerebrum’s smart context management.