1. Black-box deliverables
PDF reports or gene lists without scripts, parameter logs, or a runnable environment cannot be verified. Sandve et al. (2013) emphasize archiving software versions and parameters; Wellcome Trust (n.d.) expects replication software at publication. Probe: "Can you deliver a version-locked environment that reproduces every figure?"
2. Methodological metadata only on request
Outsourced data often lacks protocol and software-version details unless the client asks proactively (Sloan & Stenglein, 2025); refusal is a red flag. Probe: "Will chemistry, instrument model, and software versions ship automatically with every deliverable?"
3. No version-pinned compute environment
Without a conda environment.yml or Python requirements.txt, your lab cannot reliably rerun the analysis. Recreating NGS workflows without version metadata can require hundreds of hours (Piccolo & Frampton, 2016). Probe: "Will you provide an environment.yml or requirements.txt that reproduces all outputs?"
4. Vague statement of work or headline pricing
Flat per-sample quotes without milestone deliverables or acceptance criteria invite scope disputes. Low headline prices may exclude figures or reviewer support. Probe: "List exact files at each milestone, with acceptance criteria."
5. Software licensing sold as analysis
Platform licenses are not scientist-led analysis. A UI without a named analyst who understands QC and batch effects is a tool subscription. Probe: "Who interprets QC failures—a named scientist or an account manager?"
6. Data transfer before NDA and security review
Requesting raw FASTQ or BAM before NDA, or inability to describe encrypted transfer (SFTP, S3 with SSE-KMS), isolated compute, and certified deletion, signals unreadiness. Probe: "Will you sign our NDA before upload, and where will files be stored?"
7. Guaranteed significant results or implausible timelines
"Guaranteed differential expression" or manuscript-ready output in days for a large cohort signals sales pressure; rush schedules often skip QA. Probe: "What is typical queue time plus QA depth for our sample size—not your fastest case?"
8. Unclear intellectual property and code ownership
Ambiguous ownership of custom scripts and outputs causes publication problems. Red flags: exclusive pipeline rights, no Git handoff, or blocked export. Probe: "Who owns custom code and processed outputs?"
9. Sales-led scoping with no dedicated scientist
Sales-led discovery with no named scientist, or dismissing reproducibility as "technical details," leaves no accountability when QC fails. Probe: "Who owns QC decisions, and how often will we meet during analysis?"