Submitted by Fengji Zhang 4 A^2Search: Ambiguity-Aware Question Answering with Reinforcement Learning City University of Hong Kong 6 3