Actor-only Deterministic Policy Gradient via Zeroth-order Gradient Oracles in Action Space