"Building a Claude Harness (2/3) — How to Separate CLAUDE.md, Skills, Hooks, and Permissions"

5월 18, 2026

The most common mistake in Claude-based harness design is stuffing every rule into one CLAUDE.md file. But in real operation, the roles are different. CLAUDE.md is identity and baseline guidance, skills are repeatable procedures, hooks are automatic checkpoints, and permissions are access boundaries. Once those layers are separated, instructions stay shorter, automation becomes more reliable, and safety stops depending on memory alone.

Key Takeaways

CLAUDE.md is the always-loaded baseline instruction layer for project identity, hard rules, and read-first paths.
Skills are procedure packages for repeatable work. Long step-by-step workflows belong here, not in always-loaded memory.
Hooks are the automatic enforcement layer. Anything that should be checked or blocked at execution time belongs closer to hooks than to prose.
Permissions and settings are the access-boundary layer for reading, editing, executing, and external access.
So the real Claude harness problem is not writing one good instruction file. It is deciding which responsibility belongs to which layer.

1. Why separation comes first

If you look across the Claude Code documentation, memory files, settings, hooks, subagents, and slash commands are exposed as distinct surfaces. That matters for one reason above all:

A Claude harness is not meant to be run from one giant instruction file.

When teams ignore that, the same failure pattern appears repeatedly:

CLAUDE.md gets too long
important rules are buried in noise
repeatable procedures get re-explained in prompts
enforceable safety rules remain mere suggestions
permissions are handled as prose instead of configuration

That is why the core question of this post is not feature discovery. It is role separation.

2. CLAUDE.md is the first signboard, not the whole operating system

As the earlier CLAUDE.md authoring guide emphasized, CLAUDE.md is always part of context. That means it should not be treated as a dumping ground for everything the agent might ever need to know.

Good content for CLAUDE.md usually includes:

what this project is
what should be read first
what must never happen
the default workflow for small versus multi-step work
the responsibility of major directories

And some things usually belong elsewhere:

long step-by-step procedures
fast-changing operational tips
task-specific detail manuals
rules that should be mechanically enforced

In one line:

CLAUDE.md is best used as an identity file and a hard-rule signboard, not as a complete operations manual.

3. Skills separate long procedures from permanent context cost

In our own operating language, skills are reusable task modules. In the broader Claude ecosystem, related goals may also be achieved with custom commands, subagents, or reusable prompt packages. The harness principle is the same:

is the task repeated often
does it require multiple stages
does it have a stable input, validation, and output pattern

If so, it should probably move out of CLAUDE.md and into a reusable procedure layer.

Typical examples include:

pre-publish review
handoff writing
code review
test execution and summarization
directory-specific edit procedures

This creates several benefits:

CLAUDE.md stays short
long procedures are loaded only when relevant
teams can share the same operational flow
output formats become easier to standardize

So the real value of skills is not "more power." It is keeping repeatable process out of always-on context.

4. Hooks are automatic checkpoints, not just stronger instructions

In Anthropic's official hooks documentation, hooks attach to events such as PreToolUse, PostToolUse, UserPromptSubmit, Stop, and SubagentStop, and are configured through matchers and commands in settings files.

In harness terms, that translates roughly like this:

CLAUDE.md: the direction you want the agent to follow
skill: the process you want it to use
hook: the automatic device that checks or intervenes at a specific execution boundary

Typical hook-shaped rules include:

run formatters or checks after certain file edits
block or warn on sensitive path modifications
validate prompts before they are processed
verify that handoff artifacts exist before a session stops

If these stay only in prose, they depend on memory and consistency. Hooks move them closer to execution-time enforcement.

That makes hooks fundamentally different from simply writing stronger instructions.

5. Permissions carry the safety and access boundary

The official Claude Code settings documentation exposes permissions.allow, ask, deny, defaultMode, disableBypassPermissionsMode, and additionalDirectories. This matters because it lets you separate safety from prompting and express it as configuration.

The key operational principle is simple:

read
edit
execute
web access
access to sensitive files

These should not be treated as equivalent actions.

For example:

.env, secrets/**, and credential files belong in deny
destructive shell commands often belong in ask or deny
standard team commands can be narrowly allowed
personal experiments fit better in .claude/settings.local.json
shared rules belong in .claude/settings.json

This is how you separate "the AI can do this" from "the AI is allowed to do this."

6. Each layer answers a different kind of question

When in doubt, classify by question rather than by feature name.

Question	Better matching layer
What is this project, and what is the baseline posture	`CLAUDE.md`
How should this kind of task be executed step by step	skill
Should an automatic check run before or after this action	hook
Should this action be allowed, confirmed, or blocked	permissions/settings

That table alone prevents a large amount of CLAUDE.md bloat.

7. Common anti-patterns

Weak Claude harnesses tend to show the same mistakes.

7.1 Putting every rule into CLAUDE.md

The file grows, adherence drops, and the important hard rules disappear into background noise.

7.2 Re-explaining repeatable work in the prompt every time

This wastes tokens and causes process drift.

7.3 Writing blocking rules only as prose

For example, saying "do not touch credentials" without reinforcing it through permissions or hooks.

7.4 Using skills or prose where hooks are a better fit

Validation automation, path blocking, and stop-time checks are often execution-boundary problems.

7.5 Mixing personal exceptions into shared config

Personal experiments belong in local settings, not in shared team policy.

8. A practical separation rule: short rules, long procedures, automatic enforcement, narrow permissions

A reliable operating baseline can be summarized in four lines:

short rules go in CLAUDE.md
long procedures go in skills
automatic enforcement goes in hooks
narrow permissions go in settings

This works because each layer fails differently.

CLAUDE.md fails when it becomes bloated
skills fail when procedures drift and go stale
hooks fail when they over-block workflow
permissions fail when they are either too loose for safety or too tight for usability

So the goal is not to maximize one layer. It is to keep each layer thin and sharp for its role.

9. What this looks like in a repo like ours

In a content-and-scripts workspace, the split often looks like this:

CLAUDE.md: publish restrictions, config/ restrictions, read-first docs, workflow gates
skills: blog drafting, pre-publish review, handoff writing, code review
hooks: path validation after context-doc edits, checks after script changes
permissions/settings: deny secrets, ask/deny risky commands, allow narrow shared commands

That arrangement clarifies not only what the agent should do, but also what it is allowed to do.

10. Good layer separation matters more than one perfect prompt

When people first adopt Claude Code, they often focus on writing better instructions. That matters, but at operating scale, structure matters more.

baseline identity and hard rules belong in CLAUDE.md
reusable processes belong in skills
execution-time intervention belongs in hooks
access boundaries and safety policy belong in permissions and settings

Once that separation is in place, CLAUDE.md gets shorter, repeatable work becomes steadier, safety rules are forgotten less often, and collaboration gets easier.

That is why the most useful Claude harness question is not "what should I tell the model?" but this:

Should this rule live in memory, in a reusable procedure, in an automatic checkpoint, or in the permission boundary?

Part 3 compares this operational philosophy with OpenAI's and shows why the two ecosystems naturally push teams toward different harness shapes.

References

Anthropic Docs, Claude Code settings
https://docs.anthropic.com/en/docs/claude-code/settings
Anthropic Docs, Hooks reference
https://docs.anthropic.com/en/docs/claude-code/hooks
Anthropic Docs, Subagents
https://docs.anthropic.com/en/docs/claude-code/sub-agents
Anthropic Docs, Slash commands
https://docs.anthropic.com/en/docs/claude-code/slash-commands
Anthropic Docs, Security
https://docs.anthropic.com/en/docs/claude-code/security
drafts/blog/260425_CLAUDEmd작성가이드_블로그.md
drafts/blog/260425_claude_md_authoring_guide_en.md
docs/blog_series_하네스엔지니어링_총괄_design.md
sources/260518_하네스엔지니어링_15장_블로그활용노트.md

This is Part 2/3 of the OpenAI and Claude Harnesses series. Suggested next reading: the OpenAI vs Claude harness comparison.

Series overview: Harness Engineering Series Guide

이 블로그 검색

MaJu Tech Notes