What are the characteristics of data? Compare the difference in each of the following clustering types: prototype-based, density-based, graph-based. What is a scalable clustering algorithm? How do you choose the right algorithm? need 300 words and references