Actor-critic reinforcement learning for bidding in bilateral negotiation