Analyzing large data sets: rbcL 500 revisited