Digital Fingerprint of a Trader: How to Identify a Market Maker by Their Order Book Behavior

Digital fingerprint of a trader Behavioral fingerprint of a trading algorithm: unique patterns of timing, sizing, and placement

Every algorithm leaves a unique fingerprint. Learn to read it — and you will know who is on the other side of your trade.

Introduction: The Order Book as a Crime Scene

When a forensic investigator arrives at a crime scene, they look for fingerprints, shoe prints, DNA — anything that links an event to a specific individual. The order book is a place where every second, hundreds of participants leave their "fingerprints": orders of a specific size, at a specific frequency, at a specific distance from the price, with a specific time-to-live.

On centralized exchanges (CEX), these fingerprints are anonymous — you only see aggregated volume at each price level. But even from aggregated data, you can learn a lot. And on DEX exchanges with an on-chain order book — such as Hyperliquid — each order is tied to a specific wallet address, and the "fingerprints" become personal.

In this article, we will explore how to build a system of "behavioral fingerprints" for identifying participant types and specific market maker algorithms.

Part 1: What Is a Behavioral Fingerprint

Five dimensions of a digital fingerprint Radar chart: five key dimensions of a trader's behavioral fingerprint

Every Algorithm Is a Set of Rules

A market-making bot is a program that makes decisions based on rigid (or learned) rules:

How far from mid-price should orders be placed? Every MM has its own "spread profile."
What size? Round lots (100, 500, 1000)? Random numbers? Fixed size with ±5% noise?
How often should quotes be updated? Every 50ms? Only when mid-price changes? On a timer?
How to react to fills? Instant re-quoting? Pause? Shifting the remaining side?
How to react to volatility? Spread widening? Quote pulling? Size reduction?
How to manage inventory? Quote skewing when accumulating a position?

Each set of answers to these questions is a unique "signature" of the algorithm.

Five Dimensions of the Fingerprint

┌─────────────────────────────────────────────────────────────────┐
│              DIGITAL FINGERPRINT OF A TRADER                    │
│                                                                 │
│  1. TIMING       │  Inter-order intervals, reaction             │
│                  │  to events, circadian patterns               │
│──────────────────┼─────────────────────────────────────────────  │
│  2. SIZING       │  Order size distribution,                    │
│                  │  round-lot ratio, dispersion                  │
│──────────────────┼─────────────────────────────────────────────  │
│  3. PLACEMENT    │  Distance from mid-price,                    │
│                  │  bid/ask symmetry, level anchoring            │
│──────────────────┼─────────────────────────────────────────────  │
│  4. REACTION     │  Response to fills, cancels, price jumps,    │
│                  │  volatility changes                           │
│──────────────────┼─────────────────────────────────────────────  │
│  5. LIFECYCLE    │  Average order lifetime,                     │
│                  │  cancel conditions, modify vs cancel+new      │
│──────────────────┴─────────────────────────────────────────────  │
└─────────────────────────────────────────────────────────────────┘

Part 2: Feature Extraction

Hawkes process and timing patterns Self-exciting Hawkes process: event clusters with exponential decay

Timing: When and How Often

The timing pattern is the hardest element of the fingerprint to fake. It is determined by the algorithm's architecture, network latency, hardware, and even the developers' time zone.

Key metrics:

Inter-order interval (IOI) — the time between consecutive orders from the same participant. For an HFT bot, IOI = 50–500 microseconds. For a manual trader — seconds to minutes.
Reaction time to trade — the delay between an order fill and the next action. Reflects the bot's internal architecture.
Circadian pattern — the daily activity profile. An institutional MM operates during trading sessions. A crypto bot runs 24/7.
Self-excitation (Hawkes intensity) — how strongly a current order "triggers" the next one. Market makers exhibit strong self-excitation.

For modeling timing patterns, Hawkes processes — self-exciting point processes — are ideal. The Hawkes process parameters (base intensity μ, excitation coefficient α, decay rate β) form a compact "chronological fingerprint" of a participant.

Sizing: How Much and How

Size distribution — the distribution of order sizes. Professional MMs often use round lots with noise: 100 ± 5, 500 ± 10.
Size-depth correlation — does the order size depend on the current depth of the book?
Bid-ask size asymmetry — the ratio of sizes on bid and ask. The skew pattern is the "handwriting" of inventory management.
Size autocorrelation — repeatability of sizes over time.

Placement: Where in the Book

Spread offset — distance from mid-price in ticks or bps
Level preference — anchoring to specific levels (round numbers?)
Quoting symmetry — how symmetric are the bid and ask orders
Multi-level footprint — how many levels are quoted simultaneously

Reaction: How It Responds to Events

Post-fill behavior — what happens after an order is filled
Volatility response — how behavior changes when volatility increases
Quote-to-trade ratio (QTR) — the ratio of order updates to fills
Adverse selection response — reaction to price movement against the position

Lifecycle: The Order's Life Cycle

Order lifetime distribution — average order time-to-live
Modify vs Cancel+New — does it update via modify or cancel+new?
Cancel clustering — does it cancel one by one or in batches?

Part 3: Participant Classification

Market participant classification Taxonomy of market participants: market makers, HFT, opportunistic, fundamental, and noise traders

Taxonomy of Market Participants

Based on behavioral features, participants can be classified into stable categories. Research by the CFTC (Kirilenko et al., 2011):

1. Market Makers

Two-sided quotes (bid + ask) for >80% of the time
Zero or near-zero net position by end of day
High QTR (>100:1)
React to volatility by widening spreads

2. High-Frequency Traders (HFT)

Ultra-high message rate (>1000 events/minute)
Very short holding period (seconds)
Sub-millisecond reaction times

3. Opportunistic Traders

Moderate frequency
React to specific market conditions
Irregular timing, clustered around events

4. Fundamental Traders

Accumulate long-term positions
Low frequency, large order sizes
TWAP/VWAP algorithms for execution

5. Noise Traders (Retail)

Small sizes, irregular timing
Reactive: trade after price moves, not before
Market orders at peak volatility

Classification Methods

Supervised Learning: For DEX data (Hyperliquid), a training set can be assembled from addresses of known market makers. RNN models achieve accuracy >85%.

Unsupervised Learning:

Spectral Clustering (Cont et al., 2023) — clustering by order flow pattern similarity matrix
FIDR-SCAN (2024) — feature interpolation + dimension reduction + density-based clustering
Inverse Reinforcement Learning (CFTC, 2014) — recovering each trader's reward function

Part 4: Identifying Specific Market Makers

From Classification to Identification

Classification answers the question "Is this a market maker?" Identification answers the question "Is this that specific market maker?"

Building the Fingerprint Vector

From the five dimensions, we extract a numerical vector — the participant's embedding:

Fingerprint Vector (example):
┌──────────────────────────────────────────────────────────┐
│  Timing:                                                 │
│    hawkes_mu:           0.3       (base intensity)       │
│    hawkes_alpha:        0.7       (self-excitation)      │
│    hawkes_beta:         1.2       (decay rate)           │
│    median_IOI_ms:       240       (ms between orders)    │
│    circadian_peak_utc:  14.5      (peak activity)        │
│                                                          │
│  Sizing:                                                 │
│    median_size:         500                              │
│    size_cv:             0.08      (coeff. of variation)  │
│    round_lot_ratio:     0.92      (round lot share)      │
│    bid_ask_size_ratio:  0.97                             │
│                                                          │
│  Placement:                                              │
│    median_offset_bps:   3.2       (from mid-price)       │
│    quoting_symmetry:    0.94      (0=asymmetric,1=symm)  │
│    num_levels:          5         (quoted levels)         │
│                                                          │
│  Reaction:                                               │
│    post_fill_delay_ms:  12                               │
│    vol_spread_elasticity: 2.1     (spread/sigma)         │
│    qtr:                 850       (quote-to-trade ratio)  │
│                                                          │
│  Lifecycle:                                              │
│    median_lifetime_ms:  1200                             │
│    modify_ratio:        0.85      (modify vs cancel+new) │
│    batch_cancel_rate:   0.60      (batch cancel share)   │
└──────────────────────────────────────────────────────────┘

Address Clustering: "One Operator — N Wallets"

Wallet address clustering Address cluster graph: one operator — multiple wallets, grouped by behavioral similarity

On Hyperliquid, a single market maker can operate through dozens or hundreds of addresses.

Clustering algorithm:

For each active address — build a fingerprint vector over a window of N hours
Hierarchical clustering — merge addresses with distance < threshold
Temporal validation — verify cluster stability over time
Cross-pair validation — if two addresses trade different pairs but fingerprints match — strong signal

Cluster #7 (suspected: Wintermute)
├── 0x3a1f...2e8c  — BTC/USDT, 45% of activity
├── 0x7b2d...9f1a  — ETH/USDT, 30% of activity
├── 0xc4e8...5d3b  — SOL/USDT, 15% of activity
└── 0x91fa...0c7e  — ARB/USDT, 10% of activity

Common pattern: symmetric quoting, 5 levels, median_IOI=240ms,
median_size=500±8%, batch cancel 60%, presence 95%

Part 5: Manipulation Scenarios and Their Fingerprints

Spoofing: The Fingerprint of a Fake Wall

Spoofer fingerprint:
  cancel_rate:         > 95%
  lifetime:            < 2 seconds
  placement:           1-3 ticks from mid-price
  size:                anomalously large (>10x median depth)
  reaction_to_approach: cancel when price approaches
  cyclicity:           repeats >3 times / minute

Squeeze: The Liquidity Trap

Four phases: quiet accumulation, liquidity removal, stop-order cascade, profit taking. Real-time detection is possible by identifying the transition from Phase 1 to Phase 2.

Iceberg / Hidden Accumulation: The Silent Build-Up

Hidden accumulation fingerprint:
  visible_size:        small (10-50 lots)
  refill_speed:        instant (< 100ms after fill)
  refill_count:        > 20 at a single level per session
  price_reaction:      price does not move despite volume

Wash Trading: Self-Dealing

Two or more addresses from the same cluster simultaneously sit on bid and ask and execute against each other. Purpose: inflating volume.

Part 6: Implementation in Marketmaker.cc

Behavioral Fingerprint System Architecture

┌─────────────────────────────────────────────────────────────┐
│  DATA LAYER                                                 │
│  ├── Hyperliquid Node  → L3 order-by-order + wallet IDs    │
│  ├── CEX WebSocket     → L2 depth + trades                 │
│  └── Historical Store  → QuestDB / Parquet                 │
├─────────────────────────────────────────────────────────────┤
│  FEATURE EXTRACTION                                         │
│  ├── Timing Engine     → Hawkes fit, IOI distribution       │
│  ├── Sizing Engine     → Size stats, round-lot detection    │
│  ├── Placement Engine  → Offset calc, symmetry analysis     │
│  ├── Reaction Engine   → Post-fill tracker, vol response    │
│  └── Lifecycle Engine  → Lifetime stats, cancel patterns    │
├─────────────────────────────────────────────────────────────┤
│  CLASSIFICATION / IDENTIFICATION                            │
│  ├── Online Classifier → Real-time participant tagging      │
│  ├── Cluster Engine    → Address clustering (DEX)           │
│  ├── Similarity Search → Fingerprint matching               │
│  └── Anomaly Detector  → Spoof / squeeze / wash detection   │
├─────────────────────────────────────────────────────────────┤
│  VISUALIZATION                                              │
│  ├── Queue Position + Participant Labels                    │
│  ├── Cluster Graph (known MM clusters)                      │
│  ├── Alert System (manipulation detected)                   │
│  └── Historical Fingerprint Browser                         │
└─────────────────────────────────────────────────────────────┘

What the Trader Sees in the Terminal

╔═══════════════════════════════════════════════════════════════════════════╗
║  10000 USDT  │  Total: 3,200                                           ║
║──────────────┼───────────────────────────────────────────────────────── ║
║  Breakdown:  │  🤖 MM (cluster#7, ~Wintermute): 800 lots [5 levels]   ║
║              │  🤖 MM (cluster#12, unknown):     400 lots [3 levels]   ║
║              │  ⚠️ Suspicious (spoof score 87):  500 lots [lifetime<2s]║
║              │  🟡 @pro_scalper:                 100 lots              ║
║              │  🔴 MINE:                         10 lots               ║
║              │  👤 Retail / unclassified:         1,390 lots           ║
║──────────────┼───────────────────────────────────────────────────────── ║
║  Queue ahead │  Real: ~1,200 (excl. spoof)       Nominal: 1,800       ║
║  "Clean" ETA │  6.7s (vs nominal 10s)                                  ║
║  Wall type:  │  MM-backed (65% MM volume) — likely to hold             ║
╚═══════════════════════════════════════════════════════════════════════════╝

Alerts

🔴 Spoof detected at your price level — a suspicious block has been identified in front of your order
🟡 MM withdrawal — a market maker has pulled quotes, liquidity has dropped
🟡 Squeeze setup detected — a cluster of addresses is accumulating a position and removing liquidity
🟢 Wall reinforced — volume from a known MM has been added at your level

Part 7: Ethics and Limitations

What Is and Isn't Acceptable

Acceptable and encouraged:

Classifying anonymous participants by type for your own trading decisions
Detecting manipulation to protect against adverse selection
Clustering addresses on DEX to understand market structure

Not acceptable:

Deanonymizing individuals by wallet addresses
Selling identified patterns without consent
Using data for market manipulation

Limitations

Adaptive algorithms — advanced MMs add randomization
Regime switching — a single bot can change behavior depending on the market regime
False positives — two participants may coincidentally share similar parameters
CEX opacity — L3 data is unavailable on CEX

Conclusion: From Reading the Book to Reading the Participants

A traditional trader sees: 2,400 lots at the 10000 level. An advanced trader sees: "my order is 1,800th in the queue, ETA — 15 seconds." But a trader with behavioral fingerprinting sees:

"800 of those 2,400 are a market maker (likely Wintermute), the wall is solid. 500 are suspected spoofs, the real queue ahead of me is 1,300, not 1,800. Adjusted ETA — 10 seconds. The market maker isn't pulling quotes — so no large move is expected for now."

Each new layer of information is an edge. And unlike speed, the quality of order book interpretation is a domain where a retail trader can compete.

At Marketmaker.cc, we are building this system — from queue position to behavioral fingerprinting — as a unified product.

Previous article in the series: "Queue Inside the Wall: Analyzing Order Position in Order Book Density"

References and Further Reading

Kirilenko A., Kyle A., Samadi M., Tuzun T. — "The Flash Crash: High-Frequency Trading in an Electronic Market" (CFTC, 2011)
Paddrik M., Hayes R., Scherer W., Beling P. — "Gaussian Process-Based Algorithmic Trading Strategy Identification" (CFTC / OFR, 2014)
Cont R. et al. — "Unsupervised spectral clustering of trader order flow" (2023)
FIDR-SCAN — "Explainable Machine Learning for HFT Dynamics Discovery" (2024)
Do B.L., Putniņš T.J. — "Detecting Layering and Spoofing in Markets" (SSRN, 2023)
Hawkes A.G. — "Spectra of Some Self-Exciting and Mutually Exciting Point Processes" (Biometrika, 1971)
Avellaneda M., Stoikov S. — "High-Frequency Trading in a Limit Order Book" (Quantitative Finance, 2008)

Digital fingerprint of a trader Behavioral fingerprint of a trading algorithm: unique patterns of timing, sizing, and placement

Every algorithm leaves a unique fingerprint. Learn to read it — and you will know who is on the other side of your trade.

Introduction: The Order Book as a Crime Scene

In this article, we will explore how to build a system of "behavioral fingerprints" for identifying participant types and specific market maker algorithms.

Part 1: What Is a Behavioral Fingerprint

Five dimensions of a digital fingerprint Radar chart: five key dimensions of a trader's behavioral fingerprint

Every Algorithm Is a Set of Rules

A market-making bot is a program that makes decisions based on rigid (or learned) rules:

How far from mid-price should orders be placed? Every MM has its own "spread profile."
What size? Round lots (100, 500, 1000)? Random numbers? Fixed size with ±5% noise?
How often should quotes be updated? Every 50ms? Only when mid-price changes? On a timer?
How to react to fills? Instant re-quoting? Pause? Shifting the remaining side?
How to react to volatility? Spread widening? Quote pulling? Size reduction?
How to manage inventory? Quote skewing when accumulating a position?

Each set of answers to these questions is a unique "signature" of the algorithm.

Five Dimensions of the Fingerprint

┌─────────────────────────────────────────────────────────────────┐
│              DIGITAL FINGERPRINT OF A TRADER                    │
│                                                                 │
│  1. TIMING       │  Inter-order intervals, reaction             │
│                  │  to events, circadian patterns               │
│──────────────────┼─────────────────────────────────────────────  │
│  2. SIZING       │  Order size distribution,                    │
│                  │  round-lot ratio, dispersion                  │
│──────────────────┼─────────────────────────────────────────────  │
│  3. PLACEMENT    │  Distance from mid-price,                    │
│                  │  bid/ask symmetry, level anchoring            │
│──────────────────┼─────────────────────────────────────────────  │
│  4. REACTION     │  Response to fills, cancels, price jumps,    │
│                  │  volatility changes                           │
│──────────────────┼─────────────────────────────────────────────  │
│  5. LIFECYCLE    │  Average order lifetime,                     │
│                  │  cancel conditions, modify vs cancel+new      │
│──────────────────┴─────────────────────────────────────────────  │
└─────────────────────────────────────────────────────────────────┘

Part 2: Feature Extraction

Hawkes process and timing patterns Self-exciting Hawkes process: event clusters with exponential decay

Timing: When and How Often

The timing pattern is the hardest element of the fingerprint to fake. It is determined by the algorithm's architecture, network latency, hardware, and even the developers' time zone.

Key metrics:

Inter-order interval (IOI) — the time between consecutive orders from the same participant. For an HFT bot, IOI = 50–500 microseconds. For a manual trader — seconds to minutes.
Reaction time to trade — the delay between an order fill and the next action. Reflects the bot's internal architecture.
Circadian pattern — the daily activity profile. An institutional MM operates during trading sessions. A crypto bot runs 24/7.
Self-excitation (Hawkes intensity) — how strongly a current order "triggers" the next one. Market makers exhibit strong self-excitation.

Sizing: How Much and How

Size distribution — the distribution of order sizes. Professional MMs often use round lots with noise: 100 ± 5, 500 ± 10.
Size-depth correlation — does the order size depend on the current depth of the book?
Bid-ask size asymmetry — the ratio of sizes on bid and ask. The skew pattern is the "handwriting" of inventory management.
Size autocorrelation — repeatability of sizes over time.

Placement: Where in the Book

Spread offset — distance from mid-price in ticks or bps
Level preference — anchoring to specific levels (round numbers?)
Quoting symmetry — how symmetric are the bid and ask orders
Multi-level footprint — how many levels are quoted simultaneously

Reaction: How It Responds to Events

Post-fill behavior — what happens after an order is filled
Volatility response — how behavior changes when volatility increases
Quote-to-trade ratio (QTR) — the ratio of order updates to fills
Adverse selection response — reaction to price movement against the position

Lifecycle: The Order's Life Cycle

Order lifetime distribution — average order time-to-live
Modify vs Cancel+New — does it update via modify or cancel+new?
Cancel clustering — does it cancel one by one or in batches?

Part 3: Participant Classification

Market participant classification Taxonomy of market participants: market makers, HFT, opportunistic, fundamental, and noise traders

Taxonomy of Market Participants

Based on behavioral features, participants can be classified into stable categories. Research by the CFTC (Kirilenko et al., 2011):

1. Market Makers

Two-sided quotes (bid + ask) for >80% of the time
Zero or near-zero net position by end of day
High QTR (>100:1)
React to volatility by widening spreads

2. High-Frequency Traders (HFT)

Ultra-high message rate (>1000 events/minute)
Very short holding period (seconds)
Sub-millisecond reaction times

3. Opportunistic Traders

Moderate frequency
React to specific market conditions
Irregular timing, clustered around events

4. Fundamental Traders

Accumulate long-term positions
Low frequency, large order sizes
TWAP/VWAP algorithms for execution

5. Noise Traders (Retail)

Small sizes, irregular timing
Reactive: trade after price moves, not before
Market orders at peak volatility

Classification Methods

Supervised Learning: For DEX data (Hyperliquid), a training set can be assembled from addresses of known market makers. RNN models achieve accuracy >85%.

Unsupervised Learning:

Spectral Clustering (Cont et al., 2023) — clustering by order flow pattern similarity matrix
FIDR-SCAN (2024) — feature interpolation + dimension reduction + density-based clustering
Inverse Reinforcement Learning (CFTC, 2014) — recovering each trader's reward function

Part 4: Identifying Specific Market Makers

From Classification to Identification

Classification answers the question "Is this a market maker?" Identification answers the question "Is this that specific market maker?"

Building the Fingerprint Vector

From the five dimensions, we extract a numerical vector — the participant's embedding:

Fingerprint Vector (example):
┌──────────────────────────────────────────────────────────┐
│  Timing:                                                 │
│    hawkes_mu:           0.3       (base intensity)       │
│    hawkes_alpha:        0.7       (self-excitation)      │
│    hawkes_beta:         1.2       (decay rate)           │
│    median_IOI_ms:       240       (ms between orders)    │
│    circadian_peak_utc:  14.5      (peak activity)        │
│                                                          │
│  Sizing:                                                 │
│    median_size:         500                              │
│    size_cv:             0.08      (coeff. of variation)  │
│    round_lot_ratio:     0.92      (round lot share)      │
│    bid_ask_size_ratio:  0.97                             │
│                                                          │
│  Placement:                                              │
│    median_offset_bps:   3.2       (from mid-price)       │
│    quoting_symmetry:    0.94      (0=asymmetric,1=symm)  │
│    num_levels:          5         (quoted levels)         │
│                                                          │
│  Reaction:                                               │
│    post_fill_delay_ms:  12                               │
│    vol_spread_elasticity: 2.1     (spread/sigma)         │
│    qtr:                 850       (quote-to-trade ratio)  │
│                                                          │
│  Lifecycle:                                              │
│    median_lifetime_ms:  1200                             │
│    modify_ratio:        0.85      (modify vs cancel+new) │
│    batch_cancel_rate:   0.60      (batch cancel share)   │
└──────────────────────────────────────────────────────────┘

Address Clustering: "One Operator — N Wallets"

Wallet address clustering Address cluster graph: one operator — multiple wallets, grouped by behavioral similarity

On Hyperliquid, a single market maker can operate through dozens or hundreds of addresses.

Clustering algorithm:

For each active address — build a fingerprint vector over a window of N hours
Hierarchical clustering — merge addresses with distance < threshold
Temporal validation — verify cluster stability over time
Cross-pair validation — if two addresses trade different pairs but fingerprints match — strong signal

Cluster #7 (suspected: Wintermute)
├── 0x3a1f...2e8c  — BTC/USDT, 45% of activity
├── 0x7b2d...9f1a  — ETH/USDT, 30% of activity
├── 0xc4e8...5d3b  — SOL/USDT, 15% of activity
└── 0x91fa...0c7e  — ARB/USDT, 10% of activity

Common pattern: symmetric quoting, 5 levels, median_IOI=240ms,
median_size=500±8%, batch cancel 60%, presence 95%

Part 5: Manipulation Scenarios and Their Fingerprints

Spoofing: The Fingerprint of a Fake Wall

Spoofer fingerprint:
  cancel_rate:         > 95%
  lifetime:            < 2 seconds
  placement:           1-3 ticks from mid-price
  size:                anomalously large (>10x median depth)
  reaction_to_approach: cancel when price approaches
  cyclicity:           repeats >3 times / minute

Squeeze: The Liquidity Trap

Four phases: quiet accumulation, liquidity removal, stop-order cascade, profit taking. Real-time detection is possible by identifying the transition from Phase 1 to Phase 2.

Iceberg / Hidden Accumulation: The Silent Build-Up

Hidden accumulation fingerprint:
  visible_size:        small (10-50 lots)
  refill_speed:        instant (< 100ms after fill)
  refill_count:        > 20 at a single level per session
  price_reaction:      price does not move despite volume

Wash Trading: Self-Dealing

Two or more addresses from the same cluster simultaneously sit on bid and ask and execute against each other. Purpose: inflating volume.

Part 6: Implementation in Marketmaker.cc

Behavioral Fingerprint System Architecture

┌─────────────────────────────────────────────────────────────┐
│  DATA LAYER                                                 │
│  ├── Hyperliquid Node  → L3 order-by-order + wallet IDs    │
│  ├── CEX WebSocket     → L2 depth + trades                 │
│  └── Historical Store  → QuestDB / Parquet                 │
├─────────────────────────────────────────────────────────────┤
│  FEATURE EXTRACTION                                         │
│  ├── Timing Engine     → Hawkes fit, IOI distribution       │
│  ├── Sizing Engine     → Size stats, round-lot detection    │
│  ├── Placement Engine  → Offset calc, symmetry analysis     │
│  ├── Reaction Engine   → Post-fill tracker, vol response    │
│  └── Lifecycle Engine  → Lifetime stats, cancel patterns    │
├─────────────────────────────────────────────────────────────┤
│  CLASSIFICATION / IDENTIFICATION                            │
│  ├── Online Classifier → Real-time participant tagging      │
│  ├── Cluster Engine    → Address clustering (DEX)           │
│  ├── Similarity Search → Fingerprint matching               │
│  └── Anomaly Detector  → Spoof / squeeze / wash detection   │
├─────────────────────────────────────────────────────────────┤
│  VISUALIZATION                                              │
│  ├── Queue Position + Participant Labels                    │
│  ├── Cluster Graph (known MM clusters)                      │
│  ├── Alert System (manipulation detected)                   │
│  └── Historical Fingerprint Browser                         │
└─────────────────────────────────────────────────────────────┘

What the Trader Sees in the Terminal

╔═══════════════════════════════════════════════════════════════════════════╗
║  10000 USDT  │  Total: 3,200                                           ║
║──────────────┼───────────────────────────────────────────────────────── ║
║  Breakdown:  │  🤖 MM (cluster#7, ~Wintermute): 800 lots [5 levels]   ║
║              │  🤖 MM (cluster#12, unknown):     400 lots [3 levels]   ║
║              │  ⚠️ Suspicious (spoof score 87):  500 lots [lifetime<2s]║
║              │  🟡 @pro_scalper:                 100 lots              ║
║              │  🔴 MINE:                         10 lots               ║
║              │  👤 Retail / unclassified:         1,390 lots           ║
║──────────────┼───────────────────────────────────────────────────────── ║
║  Queue ahead │  Real: ~1,200 (excl. spoof)       Nominal: 1,800       ║
║  "Clean" ETA │  6.7s (vs nominal 10s)                                  ║
║  Wall type:  │  MM-backed (65% MM volume) — likely to hold             ║
╚═══════════════════════════════════════════════════════════════════════════╝

Alerts

🔴 Spoof detected at your price level — a suspicious block has been identified in front of your order
🟡 MM withdrawal — a market maker has pulled quotes, liquidity has dropped
🟡 Squeeze setup detected — a cluster of addresses is accumulating a position and removing liquidity
🟢 Wall reinforced — volume from a known MM has been added at your level

Part 7: Ethics and Limitations

What Is and Isn't Acceptable

Acceptable and encouraged:

Classifying anonymous participants by type for your own trading decisions
Detecting manipulation to protect against adverse selection
Clustering addresses on DEX to understand market structure

Not acceptable:

Deanonymizing individuals by wallet addresses
Selling identified patterns without consent
Using data for market manipulation

Limitations

Adaptive algorithms — advanced MMs add randomization
Regime switching — a single bot can change behavior depending on the market regime
False positives — two participants may coincidentally share similar parameters
CEX opacity — L3 data is unavailable on CEX

Conclusion: From Reading the Book to Reading the Participants

A traditional trader sees: 2,400 lots at the 10000 level. An advanced trader sees: "my order is 1,800th in the queue, ETA — 15 seconds." But a trader with behavioral fingerprinting sees:

Each new layer of information is an edge. And unlike speed, the quality of order book interpretation is a domain where a retail trader can compete.

At Marketmaker.cc, we are building this system — from queue position to behavioral fingerprinting — as a unified product.

Previous article in the series: "Queue Inside the Wall: Analyzing Order Position in Order Book Density"

References and Further Reading

Kirilenko A., Kyle A., Samadi M., Tuzun T. — "The Flash Crash: High-Frequency Trading in an Electronic Market" (CFTC, 2011)
Paddrik M., Hayes R., Scherer W., Beling P. — "Gaussian Process-Based Algorithmic Trading Strategy Identification" (CFTC / OFR, 2014)
Cont R. et al. — "Unsupervised spectral clustering of trader order flow" (2023)
FIDR-SCAN — "Explainable Machine Learning for HFT Dynamics Discovery" (2024)
Do B.L., Putniņš T.J. — "Detecting Layering and Spoofing in Markets" (SSRN, 2023)
Hawkes A.G. — "Spectra of Some Self-Exciting and Mutually Exciting Point Processes" (Biometrika, 1971)
Avellaneda M., Stoikov S. — "High-Frequency Trading in a Limit Order Book" (Quantitative Finance, 2008)

Introduction: The Order Book as a Crime Scene

Part 1: What Is a Behavioral Fingerprint

Every Algorithm Is a Set of Rules

Five Dimensions of the Fingerprint

Part 2: Feature Extraction

Timing: When and How Often

Sizing: How Much and How

Placement: Where in the Book

Reaction: How It Responds to Events

Lifecycle: The Order's Life Cycle

Part 3: Participant Classification

Taxonomy of Market Participants

Classification Methods

Part 4: Identifying Specific Market Makers

From Classification to Identification

Building the Fingerprint Vector

Address Clustering: "One Operator — N Wallets"

Part 5: Manipulation Scenarios and Their Fingerprints

Spoofing: The Fingerprint of a Fake Wall

Squeeze: The Liquidity Trap

Iceberg / Hidden Accumulation: The Silent Build-Up

Wash Trading: Self-Dealing

Part 6: Implementation in Marketmaker.cc

Behavioral Fingerprint System Architecture

What the Trader Sees in the Terminal

Alerts

Part 7: Ethics and Limitations

What Is and Isn't Acceptable

Limitations

Conclusion: From Reading the Book to Reading the Participants

References and Further Reading

Read More

Queue Inside the Wall: Analyzing Order Position in Order Book Density

QuestDB for Algorithmic Trading: From Order Books to Production Architecture

Anomaly Detection for Trading Bot Protection: From Z-Score to Transformer

시장에서 앞서 나가세요

Introduction: The Order Book as a Crime Scene

Part 1: What Is a Behavioral Fingerprint

Every Algorithm Is a Set of Rules

Five Dimensions of the Fingerprint

Part 2: Feature Extraction

Timing: When and How Often

Sizing: How Much and How

Placement: Where in the Book

Reaction: How It Responds to Events

Lifecycle: The Order's Life Cycle

Part 3: Participant Classification

Taxonomy of Market Participants

Classification Methods

Part 4: Identifying Specific Market Makers

From Classification to Identification

Building the Fingerprint Vector

Address Clustering: "One Operator — N Wallets"

Part 5: Manipulation Scenarios and Their Fingerprints

Spoofing: The Fingerprint of a Fake Wall

Squeeze: The Liquidity Trap

Iceberg / Hidden Accumulation: The Silent Build-Up

Wash Trading: Self-Dealing

Part 6: Implementation in Marketmaker.cc

Behavioral Fingerprint System Architecture

What the Trader Sees in the Terminal

Alerts

Part 7: Ethics and Limitations

What Is and Isn't Acceptable

Limitations

Conclusion: From Reading the Book to Reading the Participants

References and Further Reading