Answer Fast: Accelerating BERT on the Tensor Streaming Processor