Optimal string clustering based on a Laplace-like mixture and EM algorithm on a set of strings