Can Large Language Models Automatically Score Proficiency of Written Essays?