Semantic DNA (N-Gram Analysis)

Search Query Pattern Mining

Why Individual Query Analysis Isn't Enough

Your Search Terms report shows 2,000 queries. You review them one by one, add negatives, and move on.

The problem: You're looking at trees and missing the forest.

Example:

You have 47 different search queries containing the word "free":

"free project management tool"

"project management free trial"

"free alternative to asana"

... 44 more

Reviewing them individually, some look okay (free trial intent). But in aggregate, the word "free" across all 47 queries costs $3,200/month with 0.3x ROAS.

N-Gram Analysis reveals this pattern in one row: free | 47 queries | $3,200 spend | 2 conversions | 0.3x ROAS

One negative keyword ("free") eliminates $3,200/month in waste across 47 queries.

That's the power of pattern-level analysis vs query-level analysis.

1. Understanding N-Gram Types

Unigrams (single words):

Most useful for finding broad waste patterns ("free", "cheap", "jobs") or broad opportunity signals ("enterprise", "premium", "urgent").

Bigrams (two-word pairs):

The sweet spot for actionable insights. "Free trial" is very different from just "free." Bigrams capture intent more precisely.

Trigrams (three-word sequences):

Highly specific. "Best enterprise CRM" or "free project management" — these often map directly to ad group themes.

The N-Gram Table:

N-GramTypeQuery CountImpressionsClicksSpendConvROASSignal
freeUni4712,400890$3,20020.3x🔴 Waste
enterpriseUni123,200280$1,400228.2x🟢 Opportunity
free trialBi82,100180$72053.4x🟡 Mixed
best crm forTri61,800150$6001210.1x🟢 Opportunity

Signal classification:

🟢 Opportunity: ROAS >2x account average, 5+ conversions

🟡 Mixed: ROAS near account average, needs investigation

🔴 Waste: ROAS <0.5x account average OR 0 conversions with >$500 spend

2. Intent Pattern Grouping

What it does: Classifies n-grams by inferred user intent based on keyword modifiers.

Intent Groups:

🛒 Transactional: Contains buy, purchase, order, pricing, cost, subscribe, signup

Expected ROAS: Highest (users are ready to convert)

🔍 Informational: Contains how, what, why, guide, tutorial, learn, difference

Expected ROAS: Lowest (users are researching, not buying)

⚖️ Comparison: Contains best, top, review, vs, compare, alternative

Expected ROAS: High (users are evaluating options, close to purchase)

📍 Local: Contains near me, in [city], local, nearby

Expected ROAS: Varies (high for services, lower for products)

Why this matters:

If 60% of your spend goes to Informational n-grams with 0.5x ROAS, you have a targeting problem. Your broad match keywords are attracting researchers, not buyers.

Action: Add Informational modifiers as negative keywords ("how to", "what is", "tutorial") to focus budget on Transactional and Comparison intent.

But be careful: Some Informational queries convert at high rates if your landing page serves the educational need and then converts. Check before excluding.

3. Systemic Waste Detection

The most powerful feature: Finding n-grams that appear across many queries and consistently waste money.

The Waste Leaderboard:

Ranks n-grams by total wasted spend (spend on queries with <1x ROAS containing this n-gram).

Example Waste Leaderboard:

RankN-GramWasted SpendQueries AffectedAction
1free$3,200/mo47 queriesAdd as negative
2jobs$2,100/mo23 queriesAdd as negative
3template$1,800/mo31 queriesAdd as negative
4salary$950/mo12 queriesAdd as negative
5reddit$720/mo8 queriesAdd as negative

Total waste from top 5 n-grams: $8,770/month

Adding these 5 words as negative keywords eliminates waste across 121 search queries simultaneously.

Compare this to manual query review: You'd need to identify and negative-match 121 individual queries. With N-Gram Analysis, it's 5 negative keywords.

One-Click Actions:

"Add as Negative (Campaign)" → Adds as campaign-level negative

"Add as Negative (Account)" → Adds as account-level negative keyword list

"Review Queries" → Shows all queries containing this n-gram for manual review before action

4. Opportunity Mining

The flip side of waste detection: Finding n-grams that consistently produce high ROAS and should be promoted.

The Opportunity Leaderboard:

RankN-GramRevenueROASQueriesAction
1enterprise$11,200/mo8.2x12 queriesCreate ad group
2integration$8,400/mo6.1x9 queriesCreate ad group
3automated$5,600/mo5.8x7 queriesIncrease bids

Actions for high-performing n-grams:

Create Dedicated Ad Group: If "enterprise" drives 8.2x ROAS across 12 queries, create an ad group specifically for enterprise-related keywords with tailored ad copy and landing pages.

Increase Bid Modifiers: Boost bids for keywords containing high-performing n-grams.

Expand Keywords: Use the n-gram as seed for keyword research. If "enterprise" works, try "enterprise features", "enterprise pricing", "enterprise demo."

Inform Creative: If "automated" is your top-performing n-gram, make sure "automated" appears in your headlines and descriptions.

Cross-reference with Creative Lab: If an n-gram drives high ROAS but your ad copy doesn't mention it, you're missing an alignment opportunity.

5. N-Gram Trend Analysis

What it shows: How n-gram performance changes over time.

Visual: Sparkline charts showing ROAS and spend trends for each n-gram over 90 days.

Why trends matter:

A waste n-gram today might have been profitable 3 months ago. A profitable n-gram might be declining.

Trend patterns:

📈 Rising Star: Low spend initially, ROAS climbing. Worth expanding into a dedicated ad group.

📉 Fading Giant: Was profitable, now declining. Audience saturation or competitive pressure. Consider reducing exposure.

🔄 Seasonal: Performance peaks and dips cyclically. Adjust bids seasonally rather than permanently excluding.

📊 Stable Performer: Consistent ROAS over 90 days. Your foundation keywords—protect and maintain.

Monthly review: Sort by "ROAS Change (30d)" to quickly identify n-grams with the biggest positive or negative shifts.

When to Use This Dashboard (vs. Other Tools)

Use N-Gram Analysis when you want to:

Find systemic search term waste patterns

Discover high-performing keyword themes

Build negative keyword lists efficiently

Identify intent mismatches in your traffic

Don't use N-Gram Analysis when you want to:

Review individual search terms (use Search Hygiene)

Optimize ad creative (use Creative Lab)

Reallocate budgets (use Budget Balancer)

Who Should Use This:

✅ PPC specialists (advanced keyword optimization)

✅ SEO teams (search intent insights transfer to organic)

✅ Content strategists (understanding what language your audience uses)

❌ Executives (too granular)

❌ Beginners (start with Search Hygiene for query-level review)

How Often: Monthly. Weekly only if you just launched broad match campaigns and need rapid waste containment.

Technical FAQ

Q: How is this different from the Search Hygiene Cockpit?

Search Hygiene works at the individual query level—you review one query at a time. N-Gram Analysis aggregates across queries to find patterns. Think of it as: Search Hygiene finds individual weeds, N-Gram Analysis identifies the root system.

Q: What's the minimum data needed for reliable n-gram analysis?

At least 30 days of search query data with 500+ unique queries. For trigram analysis, you need 1,000+ queries. The system warns if sample sizes are too small for statistical reliability.

Q: Should I always add waste n-grams as negative keywords?

Almost always, but check first. Click 'Review Queries' to see all queries containing the n-gram. Sometimes a waste unigram ('free') contains a profitable bigram ('free trial'). In that case, add 'free' as a broad match negative but add 'free trial' as a negative keyword exception.

Q: Can n-gram analysis help with ad copy?

Absolutely. High-performing n-grams reveal the language your customers use. If 'automated workflow' has 8x ROAS, make sure those exact words appear in your headlines and descriptions. This improves ad relevance and Quality Score.

Q: How does intent grouping work for ambiguous queries?

Intent is inferred from keyword modifiers, not query meaning. A query with both 'how to' and 'buy' gets classified by the dominant modifier. You can manually reclassify any n-gram in the interface if the automatic classification is wrong.

Q: Can I export the waste and opportunity lists?

Yes. Click 'Export' on any leaderboard to download as CSV. The waste list can be directly uploaded to Google Ads as a negative keyword list. The opportunity list makes an excellent brief for keyword research and ad group creation.

Q: Does this work for Shopping campaigns?

Yes, with limitations. Shopping search terms tend to be more product-specific, so n-gram patterns are less dramatic than in Search. But waste patterns like brand names you don't carry or irrelevant modifiers ('used', 'refurbished') are still detectable.

Q: How often should I refresh my negative keyword list from n-gram data?

Monthly. New waste patterns emerge as Google Ads broad match evolves and new search behaviors appear. A one-time negative keyword cleanup decays within 2-3 months as new waste patterns replace old ones.

Semantic DNA (N-Gram Analysis) | ClickCatalyst