§ 04 · stage 2 · $500k to $5m ARR
Test what already works harder.
By $500K-$1M ARR the highest-traffic page family typically passes 100K monthly sessions and the math starts to support real A/B testing. The right cadence is 2-4 concurrent tests with proper sample-size pre-registration, focused on the templates that already convert (PDP, cart, checkout) rather than the templates that don't have enough traffic to test (low-volume collection pages, blog, account).
The first decision is the testing platform. Optimizely, VWO, AB Tasty, and Convert.com all sit in the $1K-$8K per month band depending on traffic volume; the mid-market choice is usually VWO or Convert because Optimizely's pricing accelerates fast past $30M sessions per year. For Shopify Plus stores, Shopify Functions and Theme Editor split-testing extensions cover most of the surface for free, with the caveat that statistical rigor (sample-size calculator, Bayesian or sequential analysis) usually still wants a third-party tool layered on top. The honest reframe is that the platform is the cheapest part of the program; the expensive part is the engineer-and-analyst time to ship variants and read results properly.
The five-template focus. At this stage the test surface is the PDP, the cart, the checkout, the homepage, and the top three collection pages by traffic. Anything below the top-three collection pages doesn't carry enough sessions to test cleanly inside a calendar quarter. Within those templates, the highest-leverage tests by category are: PDP image-gallery placement and proof-stacking order (reviews above or below price, video above or below image gallery, sticky add-to-cart yes or no), cart-page free-shipping-threshold messaging and cross-sell positioning, checkout payment-method ordering and address-autofill behaviour, homepage hero-product framing (single-product hero vs collection-led hero vs editorial-led hero), and collection-page filter visibility and sort-default behaviour.
Cart-recovery and email-driven CRO. The single highest-ROI work at this stage is usually the abandoned-cart and post-purchase flow architecture, not the on-site test. Klaviyo's benchmark report shows abandoned-cart flows recover 15-30 percent of cart abandons when the timing and content are right (60-minute first send, 24-hour second send with social proof, 72-hour third send with a soft incentive). We shipped a cart-recovery flow rebuild for Big Game Sports that contributed to a +420 percent BFCM revenue lift over their prior year baseline; the work was 30 hours of flow architecture, segmentation, and creative rather than 30 weeks of on-site A/B tests. The brand-side reality is that for most $1M-$5M DTC brands the email channel is a more profitable test surface than the storefront, with faster iteration cycles and cleaner attribution. See our companion piece on Klaviyo + Shopify integration for the full flow architecture.
Subscription cadence as CRO. Subscription-eligible categories (consumables, beauty, food, supplements) carry an under-discussed CRO lever: the subscription opt-in rate at checkout. Recharge data shows subscription opt-in rates running 15-35 percent at checkout when the offer is positioned correctly (default-on with one-click decline beats default-off with two-click opt-in by 11-18 percentage points; "subscribe and save 10%" beats "subscribe" by another 4-7 points). For Emani we shipped the store from $0 to $2M MRR primarily through subscription cadence rather than per-visit CR optimization; the LTV math from a 25 percent subscription opt-in rate dwarfs a 10 percent CR lift on the equivalent traffic. Brands in subscription-eligible categories that haven't built the subscription opt-in flow are leaving 40-60 percent of contribution margin on the table. The companion piece on Recharge + Shopify subscription setup covers the setup mechanics.
Customer-support feedback loops. The shortest path to a meaningful CR lift at this stage is usually the support-ticket archive, not the analytics dashboard. Gorgias, Zendesk, and Help Scout all expose ticket-tag analytics that surface the three or four pre-purchase friction points buyers ask about most. For one DTC apparel client the top pre-purchase tag was sizing uncertainty; the fix (adding a sizing-comparison module to PDP showing fit on three body types) lifted CR on the affected pages by 18 percent. The work was 12 hours of design and development; the insight came from 90 minutes spent reading 200 tagged tickets. The right CRO program at this stage reads support archives weekly.