Machine Learning Cross-Validation

There are various pipelines for the machine learning use cases:

So always remember before model creation what we do is

So our model will use 70% of the data to only train the model itself and the remaining 30% we will use to check the accuracy.

So when we do train test split 70% of data will randomly select and 30% also randomly selected.

When this kind of random selection happens, the type of data present in the test may not be present in train set.

Due to this our model accuracy may go down.

To prevent above problem we have a concept called Cross Validation.

Share2Learn