Introduction

After running 200+ experiments with different seeding techniques, we've identified four categories of seeds with measurably different effects on LLM creativity. This guide synthesizes our findings into actionable strategies.

The Four Types of Seeds

1. Random Seeds (Low Effectiveness)

Example: "velvet quantum daffodil telescope"

Theory: Break model's statistical patterns with unrelated concepts

Reality:

Diversity increase: +3-7%
Thematic influence: Negligible
Risk: None (no quality degradation)

Our data:

Tested 50 random 4-word passphrases
Control vs seeded: p=0.18 (not statistically significant)
Conclusion: Random seeds are mostly placebo

2. Thematic Seeds (High Effectiveness)

Example: "titanium platinum osmium mercury" (for spy stories)

Theory: Semantic priming shifts the model's context window

Reality:

Diversity increase: +21-45%
Thematic influence: STRONG (72% stories adopted theme)
Risk: Moderate (may override user intent)

Our data:

Metal names → mining corporations (72% of stories)
Color names → art theft scenarios (68%)
Musical terms → performance/espionage (61%)

Best practices:

Choose themes adjacent to your domain (not directly in it)
Use 3-5 words for strongest effect
Avoid proper nouns (they leak into outputs)

3. Structural Seeds (Moderate Effectiveness)

Example: "Three-act structure: setup/confrontation/resolution"

Theory: Constrain narrative architecture, free content creativity

Reality:

Diversity increase: +12-18%
Structure adherence: 85%
Risk: Low (users often want structure anyway)

Our data: Compared free-form vs structured prompts across 100 story generations:

Metric	Free-form	Structured Seed
Unique plot points	42	61
Character diversity	3.2/5	4.1/5
Coherence score	3.8/5	4.4/5

Conclusion: Structure actually increases creativity by reducing model's decision space

4. Anti-pattern Seeds (Mixed Results)

Example: "Avoid: Jack, Ace, Raven, generic hacker names"

Theory: Explicit constraints guide model away from absorption points

Reality:

Diversity increase: +8-15%
Risk: High (can create new absorption points)

Failure case: When we prompted "Don't name character Jack," we got:

"Not-Jack": 3%
"Jackson": 8%
"Jacques": 5%
Model circumvented intent 16% of the time

Combining Strategies: The Stack Approach

Our most successful technique combines multiple seed types:

SEED STACK for cyberpunk character generation:

1. Thematic seed: "neon ink circuits flesh"
2. Structural constraint: "Character has 3 defining contradictions"
3. Anti-pattern: "Avoid console cowboy archetype"
4. Upstream lock: "Character's profession: bioethicist"

Generate character backstory (200 words)

Results vs control:

Unique character archetypes: +67%
"Jack" appearances: -94%
Coherence: -2% (negligible quality loss)

Model-Specific Findings

Different models respond differently to seeding:

Claude (best for thematic seeds):

Strong semantic association
Follows thematic seeds 85% of time
Less sensitive to random seeds

GPT-4 (best for structural seeds):

Strong instruction-following
Adheres to structure 92% of time
Moderate thematic sensitivity

Llama (most resistant to seeding):

Weaker semantic priming
Thematic seeds only 60% effective
But: naturally more diverse (needs less seeding)

Practical Recommendations

For maximum diversity:

Start with thematic seed (4-5 adjacent concepts)
Add structural constraint (architecture, format)
Lock 1-2 variables upstream (name, profession, setting)
Avoid anti-patterns (they backfire)

For quality + diversity balance:

Use structural seeds only
Combine with few-shot examples
Increase temperature slightly (0.8 → 0.95)

For speed (API costs):

Thematic seed alone (single prompt)
Higher temperature (0.95-1.0)
Accept lower hit rate, generate 3-5 samples

Measuring Success

Track these metrics to evaluate your seeding strategy:

Type-Token Ratio (TTR): unique tokens / total tokens
Entropy: bits per token (higher = more diverse)
Semantic clustering: t-SNE visualization of outputs
Absorption point frequency: % of outputs using top-5 most common elements

Tools we built:

creativity-metrics.py - Calculate TTR, entropy
seed-generator.py - Generate thematic seeds

Future Work

We're currently investigating:

Dynamic seeding: Different seeds for different story sections
Adversarial seeds: Train model to resist specific patterns
Multilingual seeds: Do Japanese/Arabic seeds affect English outputs?
Seed persistence: How long does seed influence last in multi-turn chats?

Related scratchpads:

Datasets:

500 thematic seed combinations + effectiveness scores

GLITCHY TALES

The Complete Guide to LLM Seeding Strategies

Introduction

The Four Types of Seeds

1. Random Seeds (Low Effectiveness)

2. Thematic Seeds (High Effectiveness)

3. Structural Seeds (Moderate Effectiveness)

4. Anti-pattern Seeds (Mixed Results)

Combining Strategies: The Stack Approach

Model-Specific Findings

Practical Recommendations

Measuring Success

Future Work