AlphaCNOT: Learning CNOT Minimization with Model-Based Planning | hypedar