Speculative Decoding

Speculative decoding for faster autoregressive generation.

cargo run --example speculative_decode