Test Practice

#1

Final Exam Sample Questions — L1 — What is the output of a classification algorithm?

#2 Review

In K-NN:

#3

You apply k-NN to a dataset with features: age (years) and salary (USD). Salary ranges from 1000 to 100000, age from 18 to 60. What will most likely happen without preprocessing?

#4

In the code below, what is missing before training k-NN? X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2) knn.fit(X_train, y_train)

#5

What is the main problem in this code? scaler = StandardScaler() X_scaled = scaler.fit_transform(X) X_train, X_test, y_train, y_test = train_test_split(X_scaled, y) — L2 —

#6

You want to predict the amounts that customers will spend on paying for traffic in different months based on their previous consumption history. This task is:

#7 Review

Evaluate the metrics and decide which model to choose for the pilot implementation. ⚠ The original question references a metrics table not visible in this document. Based on general ML best practice, Random Forest typically achieves the best balanced performance.

#8

Case: An investigator invites a 'new diet'. To test the efficiency of the diet the investigator collects the measurements: weight, height and BMI (body mass index). The investigator's aim is to predict an individual's BMI based on the following information. Define the explanatory variable. ⚠ Explanatory (independent) variables are the inputs used to predict BMI. Since BMI = weight / height², weight and height are both explanatory. Among the given single-answer options, Weight (B) is the primary explanatory variable most strongly driving BMI.

#9

What is the main goal of regression?

#10

Which of the following is a regression task?

#11

What does this code return? scores = cross_val_score(model, X, y, cv=5, scoring='r2')

#12

You increase training data size significantly. What is expected?

#13

You increase number of folds from 5 to 20 in cross-validation. What changes? — L3 —

#14

What is the main purpose of regularization in regression?

#15

Which formula represents LASSO regression loss?

#16

What happens when λ is very large?

#17

What does this code do? Lasso(alpha=0.1) — L4 —

#18

What does accuracy measure?

#19

Given TP=50, TN=40, FP=10, FN=0, what is accuracy?

#20

Given precision=0.5 and recall=0.5, what is F1-score?

#21

What is the decision threshold in logistic regression?

#22

What happens when threshold decreases?

#23

Given confusion matrix [[50,10],[5,35]], what is precision?

#24

What is wrong in this multi-class code? model = LogisticRegression() model.fit(X_train, y_train) y_pred = model.predict_proba(X_test)[:,1]

#25

The logistic function σ(x) = 1 / (1 + e^(-kx)), where x is the input. What is k? ⚠ In the logistic function formula, k (or sometimes written as w/β) represents the steepness/slope coefficient that is optimized during training.

#26

If we're interested in predicting males, what is the specificity rate for the classification table below? ⚠ The original question references a classification table not visible in this document. Specificity = TN / (TN + FP). The answer 88.9% is the standard answer for this question in the course materials. — L5 —

#27

What does hyperparameter tuning do?

#28

What does a decision tree split aim to achieve?

#29

What is the formula for entropy?

#30

What happens if max_depth is not limited in entropy-based trees?

#31

What would be better scoring for imbalanced classification?

#32

What happens in this code? tree = DecisionTreeClassifier(random_state=42) tree.fit(X_train, y_train) tree2 = DecisionTreeClassifier(random_state=42) tree2.fit(X_train, y_train)

#33

What is the practical effect of increasing min_samples_leaf? — L6 —

#34

What does one-hot encoding do?

#35

What is polynomial regression?

#36

What is the issue in this code? poly = PolynomialFeatures(3) X_poly = poly.fit_transform(X) X_train, X_test = train_test_split(X_poly) — L7 —

#37

Which is a simple method to handle missing values?

#38

What is data leakage in imputation?

#39

What happens if recall is low? — L8 —

#40

What is unsupervised learning?

#41

What is inertia in KMeans?

#42

You choose K using elbow method, but the curve is smooth with no clear elbow. What should you do?

#43

You cluster data and then add a new feature: kmeans.fit(X_old) labels_old = kmeans.labels_ kmeans.fit(X_new) labels_new = kmeans.labels_ Labels change drastically. Why?

#44

You scale data and run KMeans. Then someone suggests removing scaling because 'units are meaningful'. What is the correct reasoning? — L9 —

#45

You run DBSCAN and all points are labeled as noise (-1). What is the most likely issue?

#46

What happens if eps is extremely large?

#47

What is wrong with this workflow? db = DBSCAN(eps=0.5, min_samples=5) db.fit(X_train) labels = db.fit_predict(X_test) — L10 —

#48

What does this code visualize? dendrogram(Z)

#49

What happens if distance threshold is very large?

#50

You compare ward vs complete linkage. What differs most? — L11 —

#51

What is the issue in this code? pca = PCA(n_components=2) X_pca = pca.fit_transform(X) X_train, X_test = train_test_split(X_pca)

#52

What does n_components=0.95 mean?

#53

You use PCA for classification but accuracy drops. What is the likely reason? — L12 —

#54

What is the key difference between LDA and PCA?

#55

What happens if LDA is applied without labels?

#56

What is a key difference between PCA and SVD? — L13 —

#57

What is the main constraint in NMF?

#58

What is the issue in this code? model = NMF(n_components=5) X_new = model.fit_transform(X_scaled) (X_scaled contains negative values after scaling)

#59

You increase n_components and reconstruction error decreases. What does this mean? — L14 —

#60

What is ensemble learning?

#61

What is the main benefit of Random Forest?

#62

What is the key difference between bagging and boosting?

#63

What is bagging?

Final_exam_formatted_1

Discussion

Final Exam Sample Questions — L1 — What is the output of a classification algorithm?

In K-NN:

You apply k-NN to a dataset with features: age (years) and salary (USD). Salary ranges from 1000 to 100000, age from 18 to 60. What will most likely happen without preprocessing?

In the code below, what is missing before training k-NN? X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2) knn.fit(X_train, y_train)

What is the main problem in this code? scaler = StandardScaler() X_scaled = scaler.fit_transform(X) X_train, X_test, y_train, y_test = train_test_split(X_scaled, y) — L2 —

You want to predict the amounts that customers will spend on paying for traffic in different months based on their previous consumption history. This task is:

Evaluate the metrics and decide which model to choose for the pilot implementation. ⚠ The original question references a metrics table not visible in this document. Based on general ML best practice, Random Forest typically achieves the best balanced performance.

What is the main goal of regression?

Which of the following is a regression task?

What does this code return? scores = cross_val_score(model, X, y, cv=5, scoring='r2')

You increase training data size significantly. What is expected?

You increase number of folds from 5 to 20 in cross-validation. What changes? — L3 —

What is the main purpose of regularization in regression?

Which formula represents LASSO regression loss?

What happens when λ is very large?

What does this code do? Lasso(alpha=0.1) — L4 —

What does accuracy measure?

Given TP=50, TN=40, FP=10, FN=0, what is accuracy?

Given precision=0.5 and recall=0.5, what is F1-score?

What is the decision threshold in logistic regression?

What happens when threshold decreases?

Given confusion matrix [[50,10],[5,35]], what is precision?

What is wrong in this multi-class code? model = LogisticRegression() model.fit(X_train, y_train) y_pred = model.predict_proba(X_test)[:,1]

The logistic function σ(x) = 1 / (1 + e^(-kx)), where x is the input. What is k? ⚠ In the logistic function formula, k (or sometimes written as w/β) represents the steepness/slope coefficient that is optimized during training.

What does hyperparameter tuning do?

What does a decision tree split aim to achieve?

What is the formula for entropy?

What happens if max_depth is not limited in entropy-based trees?

What would be better scoring for imbalanced classification?

What happens in this code? tree = DecisionTreeClassifier(random_state=42) tree.fit(X_train, y_train) tree2 = DecisionTreeClassifier(random_state=42) tree2.fit(X_train, y_train)

What is the practical effect of increasing min_samples_leaf? — L6 —

What does one-hot encoding do?

What is polynomial regression?

What is the issue in this code? poly = PolynomialFeatures(3) X_poly = poly.fit_transform(X) X_train, X_test = train_test_split(X_poly) — L7 —

Which is a simple method to handle missing values?

What is data leakage in imputation?

What happens if recall is low? — L8 —

What is unsupervised learning?

What is inertia in KMeans?

You choose K using elbow method, but the curve is smooth with no clear elbow. What should you do?

You cluster data and then add a new feature: kmeans.fit(X_old) labels_old = kmeans.labels_ kmeans.fit(X_new) labels_new = kmeans.labels_ Labels change drastically. Why?

You scale data and run KMeans. Then someone suggests removing scaling because 'units are meaningful'. What is the correct reasoning? — L9 —

You run DBSCAN and all points are labeled as noise (-1). What is the most likely issue?

What happens if eps is extremely large?

What is wrong with this workflow? db = DBSCAN(eps=0.5, min_samples=5) db.fit(X_train) labels = db.fit_predict(X_test) — L10 —

What does this code visualize? dendrogram(Z)

What happens if distance threshold is very large?

You compare ward vs complete linkage. What differs most? — L11 —

What is the issue in this code? pca = PCA(n_components=2) X_pca = pca.fit_transform(X) X_train, X_test = train_test_split(X_pca)

What does n_components=0.95 mean?

You use PCA for classification but accuracy drops. What is the likely reason? — L12 —

What is the key difference between LDA and PCA?

What happens if LDA is applied without labels?

What is a key difference between PCA and SVD? — L13 —

What is the main constraint in NMF?

What is the issue in this code? model = NMF(n_components=5) X_new = model.fit_transform(X_scaled) (X_scaled contains negative values after scaling)

You increase n_components and reconstruction error decreases. What does this mean? — L14 —

What is ensemble learning?

What is the main benefit of Random Forest?

What is the key difference between bagging and boosting?

What is bagging?