<p>An input tangled sequence such as an instruction stream is compressed by modeling the sequence into multiple Markov chains to simultaneously capture and exploit the correlations among and within subsequences that are mingled in the input tangled sequence. The multiple Markov chains may be combined to drive an entropy coder.</p>