Production Intent Pipeline for B2B Sales Intelligence

Managed pipeline that identifies in-market buyers from LinkedIn engagement signals, scores and enriches contacts against a custom ICP, and delivers qualified leads directly into HubSpot — designed for continuous weekly operation with resumable execution and conservative rate limiting.

Anaïs Ofranc Mar 25, 202613 min read
OrganisationIoT provider
IndustryPropTech

The Challenge

B2B sales development teams spend 11+ hours per rep per week on manual prospecting research with no systematic way to distinguish contacts who are actively in-market from those on a stale lead list. Off-the-shelf intent data platforms cost £2,000–5,000/month and deliver account-level signals requiring human interpretation. The client's target customers (social housing providers) has fragmented providers, inconsistent role titles, and topic-specific regulatory intent signals invisible to standard tools.

The Solution

QualitaX designed and operates a five-stage managed intent pipeline: (1) people search and topic post discovery via SearchAPI.io, (2) engager extraction via PhantomBuster multi-agent orchestration, (3) two-pass profile enrichment minimising API calls, (4) cross-reference and scoring against target organisations with 21 ICP regex patterns, and (5) CRM delivery to HubSpot with six custom intent properties and smart update logic. The system runs continuously with resumable execution, conservative LinkedIn rate limiting, and configurable query budgets.

Overview

This project addresses a structural inefficiency in B2B outbound sales: the gap between intent signal detection and actionable sales intelligence. Enterprise intent data platforms surface account-level buying signals but leave the last mile — identifying the specific person, confirming ICP fit, and routing to the right rep — to manual effort. At the other end, LinkedIn automation tools offer contact finding but limited intelligence bespoke to a given business, no compliance controls, and no operational resilience.

The LinkedIn intent pipeline occupies the gap between these two categories. It is more actionable than intent data platforms (as it delivers named, scored contacts directly into the client's CRM) and more reliable than point automation tools (running unattended with resumable execution, structured progress tracking, and conservative rate limiting at every external API boundary).

The Challenge

B2B sales development teams spend 11+ hours per rep per week on manual prospecting research — scrolling LinkedIn, cross-referencing job titles, guessing at buyer intent — with no systematic way to distinguish contacts who are actively in-market from those on a stale lead list. Off-the-shelf intent data platforms (Bombora, 6sense, Cognism) cost £2,000–5,000/month and deliver account-level signals that still require human interpretation. The alternative — building an internal automation — typically produces a script that works for three weeks before silently breaking, with no monitoring, no error recovery, and no audit trail.

Prospecting into the client's industry (social housing sector) presents a unique set of difficulties that generic sales intelligence tools are poorly equipped to address:

The Solution

QualitaX designed and operates a managed intent pipeline that inverts the traditional prospecting workflow. Instead of finding companies, then finding contacts, then guessing at intent — the pipeline discovers intent signals first (people actively engaging with topic-relevant LinkedIn posts), retrieves their profiles, enriches them with company and role data, cross-references them against the client's target organisation list, scores them by role relevance and engagement depth, and pushes qualified leads directly into HubSpot with full provenance. The system is designed for continuous weekly operation across multiple days, with every step saving progress incrementally and respecting conservative LinkedIn safety limits.

The Approach: Intent-First Architecture

The pipeline inverts the conventional prospecting funnel. Rather than starting with a target account list and searching for contacts, it starts with live engagement signals and filters backward to ICP-matched contacts at verified organisations.

Five-Stage Pipeline

The system executes five discrete stages in sequence, each with independent progress tracking and error handling. Every stage saves state incrementally after each action — a network error, rate limit, or manual interruption at any point loses at most one action's worth of work.

Stage 1a — People Search (SearchAPI.io). Google search queries find LinkedIn profiles of people using four role keywords (asset, compliance, sustainability, carbon). This produces 6,316 queries (1,579 orgs × 4 keywords), executed over multiple days at a configurable daily limit (default: 2,000 queries/day). Each query is structured as site:linkedin.com/in "{Organisation}" "{keyword}" -intitle:hiring -intitle:apply. Progress is saved after every 25 queries.

Stage 1b — Topic Post Discovery (SearchAPI.io). Four curated Google search queries discover recent LinkedIn posts discussing Awaab's Law, SHDF/retrofit funding, tenant satisfaction, and fuel poverty. Each query is restricted to site:linkedin.com/posts with topic-specific keywords, date-filtered to the past month, and paginated up to three pages. This stage typically completes in under a minute with approximately 12 API calls.

Stage 2 — Engager Extraction (PhantomBuster). The Post Commenter and Liker Scraper extracts every person who engaged with the discovered posts. This is the pipeline's core insight: engagement with a topic-relevant post is a direct intent signal. A compliance manager who commented on an Awaab's Law post with specific concerns about damp remediation timelines has publicly demonstrated both domain expertise and active problem awareness — a categorically different signal from a name on a purchased list.

The extraction step is operationally complex. The PhantomBuster Post Commenter and Liker Scraper is a multi-agent orchestrator comprising four internal agents: a master coordinator, a post extractor, a commenter worker, and a liker worker. Processing a single post requires a six-step orchestration sequence: save the post URL to the master config, launch the post extractor and wait for completion, launch the master to process commenters, launch the liker worker, launch the master again to combine results, then download the result CSV from S3 storage. Each post takes 3–5 minutes to process.

Stage 3 — Profile Enrichment (PhantomBuster, Two-Pass). A two-pass strategy minimises expensive LinkedIn Profile Scraper calls. Pass A (free, instant) parses the occupation field returned by the engager scraper — patterns like "Director of Asset Management at ABC" — to extract company names using separator detection (at, -, |). Pass B (PhantomBuster) enriches only the profiles where company extraction failed, at a rate of 40 profiles per day during UK business hours (9am–5pm) with randomised 15–45 second delays between actions. This two-pass approach saves approximately 40–50% of enrichment API calls.

Stage 4 — Cross-Reference and Score (Python). Enriched profiles are matched against a target list using a three-tier fuzzy matching algorithm: exact match, substring match, and suffix-stripped match. Job titles are evaluated against 21 regex patterns covering the client's ICP roles. A composite intent score is calculated:

Stage 5 — CRM Delivery (HubSpot). Qualified leads (score ≥ 10, matched organisation required) are pushed to HubSpot via the CRM API. Six custom contact properties are auto-created on first run: intent_topic (dropdown), intent_last_signal_date (date), intent_interaction_type (dropdown: commented/liked/both), intent_signal_detail (post snippet or comment text, truncated to 200 words), intent_score (number), and intent_source (text label). Smart update logic prevents overwriting existing standard fields and only updates intent scores upward — a contact whose score was 20 last week will not be downgraded to 15 this week.

Operational Controls

The pipeline is built with operational discipline that separates it from typical prospecting automation:

Key Benefits and Results

The pipeline delivers five concrete outcomes that manual prospecting and generic intent platforms cannot match:

Scaling and Operational Outlook

The pipeline is designed for continuous weekly operation. A recommended two-tier architecture combines Tier 1 (the automated intent pipeline, answering "who is actively talking about our topics right now?") with Tier 2 (LinkedIn Sales Navigator for direct prospecting, answering "who are the decision-makers at target org X?"). Sales Navigator handles people-by-role search more effectively than Google scraping, eliminates the 6,316-query people search entirely, and provides native HubSpot integration — while the intent pipeline provides the unique signal that no off-the-shelf tool delivers.