Evaluating Large Language Models in Theory of Mind Tasks