Learning operators with coupled attention