Hi @CatalinaAlbornoz, thank you for your reply. Yes, in most cases, people are making effort on generating more samples to improve model performance. But in my case, the dataset is too large. I have about 100,000 samples with 600 features. It is impractical to train my ML model on this dataset in my local environment. So the requirement of my project is to reduce the number of samples using quantum methods. I notice one interesting example about amplitude encoding in this page, but I don’t know if it could work for my case. I actually post a new question about it in the forum. If you are interested in it, you may take a look.