I use the (paid) SIP feed for historical 1 min OHLC data. The data seems pretty much in line with most sources except from time to time there are these random incorrect spikes (see image, top is Alpaca data, bottom is Yahoo).
What’s going on there and is there a fix? This makes the data pretty useless for backtesting because these spikes will almost certainly (incorrectly) trigger stop-loss or take-profits limits.