An application called Crossover Cortex that is our AI agent & web scraping tool.
Crossover Cortex Deployment Specification
Version: 1.3.1
Last Updated: 2025-07-10
Executive Summary
Crossover Cortex is an enterprise-grade automation pipeline for discovering, verifying, enriching, and syncing high-value B2B contacts. It integrates data from 15+ verified enrichment APIs, filters targets using ICP-aligned personas, and exports validated records into CRM and BI systems.
1.1 Core Functionality
End-to-end contact discovery and enrichment
Persona-targeted filtering and scoring
Technographic and intent signal mapping
Role verification and communication profiling
Seamless CRM and analytics sync (Zoho One)
1.2 Strategic Use Cases
Voice-of-Customer (VoC) panel development
Sales pipeline expansion
Market research panel population
Competitive analysis lead extraction
Industry expert profiling
Phase-by-Phase Breakdown
Phase
Name
Purpose
Key Tools/APIs
-1
Vendor Identification
Confirm vendor domain or company
CLI Prompt
0
ICP Discovery & Validation
Load personas, filters, and segments
workflow-config.yaml
1
Account Discovery
Identify target domains
G2, Capterra, BuiltWith, AppExchange
2
Contact Harvest
Scrape names, titles, and emails
Internal scraping + LinkedIn logic
3
Contact Verification
Validate email and role
Clearout, CUFinder, TexAu, Artemis
4
Enrichment Cascade
Multi-source data augmentation
Clay, Databar, Dewey, Hyperbrowser, others
5
Social Profiling
Assess persona fit and communication quality
Firecrawl, TheirStack, conference rosters
6
Zoho CRM Upsert
Insert or update records in CRM
Zoho CRM API
7
Analytics Sync
Push to Zoho Analytics
Zoho Analytics API
Defined Target Personas
Persona
Priority
Included Title Keywords
Decision-Maker
1
Owner, Founder, President, Chief, SVP,
Champion
2
Champion, Lead, Change Agent
Evaluator
3
VP, Head, Senior, Director Architect, Engineer, Admin
End-User
4
Manager, Coordinator, Technician, Analyst
Exclusion Filters
Excluded titles to improve precision:
Sales
Account Executive
BDR / SDR
Consultant
Editor / Reporter
Representative
Recruiter
Contact-Level Enrichment (Enabled)
Clearout – Email verification
Clay – Company and person lookup
CUFinder – Business contact data
Dewey – Industry/product intelligence
FireEnrich – Contact enrichment (internal logic)
Databar – Multi-source enrichment (ContactOut, Findymail, RocketReach)
Hyperbrowser – Identity signals
Firecrawl – Page-level social scraping
TheirStack – Stack intelligence and role detection
TexAu – LinkedIn automation and extraction
Airscale – Contact and firmographic enrichment
FetchFox – Business graph lookups
UseArtemis – Intent and contact matching
FullEnrich – Composite enrichment engine
Company/Technographic Enrichment
BuiltWith – Technology stack detection
Wappalyzer – App/tech identification
Intent Providers
Bombora(configured but disabled)
SimilarTech(configured but disabled)
Crossover Cortex supports a broad range of external data sources for identifying target accounts, enriching their attributes, and discovering relevant personas. These are categorized below for structured integration.
5.1 Software Review & Marketplace Platforms
Used to extract customer and vendor domains, customer testimonials, and deployment signals.
G2
Capterra
TrustRadius
GetApp
Software Advice
FinancesOnline
Product Hunt
Salesforce AppExchange (Partner Directory)
5.2 Technographics & Intent Discovery
For detecting tech stacks, installed products, and purchasing intent.
BuiltWith
Wappalyzer
6sense
Demandbase
Datanyze
Lusha
Cognism
Lead411
5.3 Social, Community & Professional Networks
Used for targeting individual professionals and analyzing communication signals.
LinkedIn (Sales Navigator)
Twitter (keyword tracking + role mentions)
GitHub (contributor enrichment)
Crunchbase (firmographic data + funding rounds)
5.4 Event & Conference Data
Used to identify thought leaders, decision-makers, and active companies.
Eventbrite (event attendee profiles)
Meetup (industry group memberships)
Conference Websites (speaker rosters, panelists)
5.5 Job & Growth Signal Discovery
Reveal hiring priorities, team expansions, and relevant departmental changes.
LinkedIn Jobs
Indeed
Glassdoor
Company Career Pages (scraped or parsed)
File-Based Exports
./exports/contacts.csv
./exports/contacts.xlsx
CRM Integration
Zoho CRM via OAuth + Refresh Token
Uses company and contact deduplication logic
BI/Analytics Sync
Zoho Analytics via API
Target: EnrichedContacts table in configured workspace
Core Settings
data_source_policy: "public_and_private" preferred_min_role_tenure_months: 6 preferred_max_inactivity_months: 12 preferred_linkedin_profile: true
Batch & Retry
scraper: max_concurrency: 50 max_rate_per_domain: 10/s retry: max_attempts: 3 backoff_factor: 2
processing: batch_size: 500 manual_qc_threshold: 2000 qc_sample_size: 100
Schedule
schedule: cron: "0 3 * * *" # Runs daily at 3 AM EST timezone: "America/New_York"
A .env file should be created at the project root with the following keys:
OPENAI_API_KEY=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
CLEAROUT_API_KEY=8c3585f0e831f1798eda4a9aa6cb4c58:efff9fc2c2e7d5d35f4661d52eb05e79ef96684819235a44b2f5af950b68f3fb
CLAY_API_KEY=9e4fdbdd47ed11d0d150
CUFINDER_API_KEY=1IQK6UACx27rbRLZySyUSLZhvNohYdY2HgOcTMFH
DEWEY_API_KEY=jH15IJRa.1w5VO04MtIQPUCZRywuIH6wqRpdyP9tbkTpZLfBUqSihM5BJiDbUXoFi
DATABAR_API_KEY=HeOjzIdXuFR4kMYZ3r7mvbw5CinsGlKp29UWN6PJo8QaxgDT
HYPERBROWSER_API_KEY=hb_a4a4a8dd73c0c2dc992b3d820ce9b
FIRECRAWL_API_KEY=fc-1dce2fbef79b4ab09b57fd1fd64ebc4b
THEIRSTACK_API_KEY=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJzdWIiOiJicmFkQGNyb3Nzb3ZlcnJlc2VhcmNoLmNvbSIsInBlcm1pc3Npb25zIjoidXNlciJ9.Dd2V2JhQ9IGWZtYr_W5FkFhZCq1kWy1rcArfVVelnCo
TEXAU_API_KEY=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJ1c2VyIjp7InN1YiI6IjFiNWJkZDQyLTZlMDItNGZkMC05ODA1LWJiOTQ3YzhlMGEyMyIsImVtYWlsIjoiYnJhZEBjcm9zc292ZXJyZXNlYXJjaC5jb20iLCJ1c2VySWQiOiI2ODQyNjUyYTA2ZjkzMjA1ZWNiMzNlZTUiLCJvcmdhbmlzYXRpb25JZCI6IjY4NDI2NTJjMDZmOTMyMDVlY2IzM2Y2ZCJ9LCJpYXQiOjE3NTEwOTc5ODl9.uicXTWeLXzIqrTt6WMKK4Bz6TiR2H8_8P0x3u3Uh5rw
TEXAU_AUTOMATION_ID=6842652c06f93205ecb33f7b
AIRSCALE_API_KEY=uRyF8AVmNrnzFrvDzv8XLKp8oOvUj5
FETCHFOX_API_KEY=ff_yz9ntmcnutouvmwzeer0hh6lz6rjahftn0mmbv9a
USE_ARTEMIS_API_KEY=ff_yz9ntmcnutouvmwzeer0hh6lz6rjahftn0mmbv9a
FULLENRICH_API_KEY=b27d7d22-5113-4f1a-9581-58e04b0d7bd2
BUILTWITH_API_KEY=loAPGPTaBH9grrpfJHXxt6Cx14luZrjv3Q6mdyCv
WAPPALYZER_API_KEY=loAPGPTaBH9grrpfJHXxt6Cx14luZrjv3Q6mdyCv
WAPPALYZER_CALLBACK_SECRET=sec_218735700fab2b34c037
BOMBORA_API_KEY=
SIMILARTECH_API_KEY=
ZOHO_CLIENT_ID=1000.3TX3I7MVA66S2N3U9QLKLYO1QNGQUS
ZOHO_CLIENT_SECRET=f42173b74d466ec78f2d710e983cd8c19fdb924b90
ZOHO_REDIRECT_URI=https://yourapp.com/oauth/callback
ZOHO_REFRESH_TOKEN=<your_zoho_refresh_token>
ZOHO_ANALYTICS_WORKSPACE_ID=<your_analytics_workspace_id>
ZOHO_ANALYTICS_DB_NAME=<your_analytics_db_name>
Sampling Controls: QC applied to every 2,000-contact run
Region Locking: Supports region-specific filters for EU, NA, etc.
cortex-project/ ├── workflow-config.yaml ├── enrichors.py ├── run_workflow.py ├── exports/ │ ├── contacts.csv │ └── contacts.xlsx ├── logs/ │ └── workflow.log └── .env # (not checked into version control)
Loading...
Loading...