Objective

Deliver a clean, de‑duplicated dataset of therapeutic infusion locations for SolisRx. The dataset will exclude retail IV‑vitamin/hydration "drip bars," and will tag medical site types relevant to care delivery and analytics.

Scope

Include & Tag

  • Ambulatory Infusion Centers
  • In‑Practice Infusion Suites inside rheum/GI/neuro groups In‑Practice Infusion Suites
  • HOPD — Hospital Outpatient Infusion Departments HOPD

Exclude (For Now)

  • Home‑infusion–only providers (no on‑site chairs)
  • Oncology/Chemo‑Only Infusion Suites Oncology Infusion Suites

Data Sources

NICA Infusion Center Finder

Seed for AIC/AIS/HOPD locations

Google Maps/Places

Inclusive/exclusive search terms

Official Location Pages

AIC/AIS/HOPD websites and network location lists

Approach

Phase 1 — Broad Harvest (Maximize Recall)

Gather locations from sources using inclusive terms around "infusion," "infusion services/center," and "outpatient/ambulatory infusion." Capture raw attributes and page text for later filtering.

Phase 2 — Rule-Based Filtering & Scoring (Maximize Precision)

Apply inclusion/exclusion rules and tag site types. Compute a likelihood score (0–100)for "therapeutic infusion" and store reason codes for every decision.

Key Data Fields

Location & Contact

  • Facility name & full address
  • Lat/long & Google Place ID
  • Phone & website URL
  • Hours by day (Mon–Sun)

Analytics & Scoring

  • Review count & average rating
  • Site type tags & parent company
  • Therapy/med mentions
  • Likelihood score (0–100) + reason codes

Deliverables

1Raw Dataset — Broad harvest data in CSV or Parquet format
2Filtered Dataset — Rule-based precision with site-type tags, scores, and reason codes
3Data Dictionary — Complete field descriptions, tags, and scoring logic
4QA Summary — Precision/recall notes, de-duplication, and edge cases