[2024] New AIP-210 exam Free Sample Questions to Practice [Q34-Q56]

[2024] New AIP-210 exam Free Sample Questions to Practice

Cover Real AIP-210 Exam Questions Make Sure You 100% Pass

NEW QUESTION # 34
Which of the following equations best represent an LI norm?

A. |x| + |y|
B. |x|-|y|
C. |x|^2+|y|^2
D. |x|+|y|^2

Answer: A

Explanation:
Explanation
An L1 norm is a measure of distance or magnitude that is defined as the sum of the absolute values of the components of a vector. For example, if x and y are two components of a vector, then the L1 norm of that vector is |x| + |y|. The L1 norm is also known as the Manhattan distance or the taxicab distance, as it represents the shortest path between two points in a grid-like city.

NEW QUESTION # 35
What is Word2vec?

A. A word embedding method that builds a one-hot encoded matrix from samples and the terms that appear in them.
B. A word embedding method that finds characteristics of words in a very large number of documents.
C. A bag of words.
D. A matrix of how frequently words appear in a group of documents.

Answer: B

Explanation:
Explanation
Word2vec is a word embedding method that finds characteristics of words in a very large number of documents. Word embedding is a technique that converts words into numerical vectors that represent their meaning, usage, or context. Word2vec learns a dense and continuous vector representation for each word based on its context in a large corpus of text. Word2vec can capture the semantic and syntactic similarity and relationships among words, such as synonyms, antonyms, analogies, or associations1.

NEW QUESTION # 36
Below are three tables: Employees, Departments, and Directors.
Employee_Table

Department_Table

Director_Table
ID
Firstname
Lastname
Age
Salary
DeptJD
4566
Joey
Morin
62
$ 122,000
1
1230
Sam
Clarck
43
$ 95,670
2
9077
Lola
Russell
54
$ 165,700
3
1346
Lily
Cotton
46
$ 156,000
4
2088
Beckett
Good
52
$ 165,000
5
Which SQL query provides the Directors' Firstname, Lastname, the name of their departments, and the average employee's salary?

A. SELECT m.Firstname, m.Lastname, d.Name, AVG(e.Salary) as Dept_avg_Salary FROM Employee_Table as e RIGHT JOIN Departmentjable as d on e.Dept = d.Name INNER JOIN Directorjable as m on d.ID = m.DeptJD GROUP BY d.Name
B. SELECT m.Firstname, m.Lastname, d.Name, AVG(e.Salary) as Dept_avg_Salary FROM Employee_Table as e RIGHT JOIN Department_Table as d on e.Dept = d.Name INNER JOIN Directorjable as m on d.ID = m.DeptJD GROUP BY e.Salary
C. SELECT m.Firstname, m.Lastname, d.Name, AVG(e.Saiary) as Dept_avg_Saiary FROM Employee_Table as e LEFT JOIN Department_Table as d on e.Dept = d.Name LEFT JOIN Directorjable as m on d.ID = m.DeptJD GROUP BY m.Firstname, m.Lastname, d.Name
D. SELECT m.Firstname, m.Lastname, d.Name, AVG(e.Salary) as Dept_avg_Salary FROM Employee_Table as e RIGHT JOIN Department_Table as d on e.Dept = d.Name INNER JOIN Directorjable as m on d.ID = m.DeptID GROUP BY m.Firstname, m.Lastname, d.Name

Answer: D

Explanation:
Explanation
This SQL query provides the Directors' Firstname, Lastname, the name of their departments, and the average employee's salary by joining the three tables using the appropriate join types and conditions. The RIGHT JOIN between Employee_Table and Department_Table ensures that all departments are included in the result, even if they have no employees. The INNER JOIN between Department_Table and Directorjable ensures that only departments with directors are included in the result. The GROUP BY clause groups the result by the directors' names and departments' names, and calculates the average salary for each group using the AVG function. References: SQL Joins - W3Schools, SQL GROUP BY Statement - W3Schools

NEW QUESTION # 37
Normalization is the transformation of features:

A. Into the normal distribution.
B. So that they are on a similar scale.
C. By subtracting from the mean and dividing by the standard deviation.
D. To different scales from each other.

Answer: B

Explanation:
Explanation
Normalization is the transformation of features so that they are on a similar scale, usually between 0 and 1 or
-1 and 1. This can help reduce the influence of outliers and improve the performance of some machine learning algorithms that are sensitive to the scale of the features, such as gradient descent, k-means, or k-nearest neighbors. References: [Feature scaling - Wikipedia], [Normalization vs Standardization - Quantitative analysis]

NEW QUESTION # 38
Which of the following items should be included in a handover to the end user to enable them to use and run a trained model on their own system? (Select three.)

A. Intermediate data files
B. Information on the folder structure in your local machine
C. Sample input and output data files
D. Link to a GitHub repository of the codebase
E. README document

Answer: C,D,E

Explanation:
Explanation
A handover is the process of transferring the ownership and responsibility of an ML system from one party to another, such as from the developers to the end users. A handover should include all the necessary information and resources that enable the end users to use and run a trained model on their own system. Some of the items that should be included in a handover are:
Link to a GitHub repository of the codebase: A GitHub repository is an online platform that hosts the source code and version control of an ML system. A link to a GitHub repository can provide the end users with access to the latest and most updated version of the codebase, as well as the history and documentation of the changes made to the code.
README document: A README document is a text file that provides an overview and instructions for an ML system. A README document can include information such as the purpose, features, requirements, installation, usage, testing, troubleshooting, and license of the system.
Sample input and output data files: Sample input and output data files are data files that contain examples of valid inputs and expected outputs for an ML system. Sample input and output data files can help the end users understand how to use and run the system, as well as verify its functionality and performance.

NEW QUESTION # 39
When should the model be retrained in the ML pipeline?

A. Some outliers are detected in live data.
B. A new monitoring component is added.
C. Concept drift is detected in the pipeline.
D. More data become available for the training phase.

Answer: C

Explanation:
Explanation
When concept drift is detected in the pipeline, it means that the model performance has degraded over time due to changes in the underlying data generating process. This requires retraining the model with new data that reflects the current situation and updating the model parameters accordingly. References: Use pipeline parameters to retrain models in the designer - Azure Machine Learning | Microsoft Learn, Retraining Model During Deployment: Continuous Training and Continuous Testing

NEW QUESTION # 40
Which of the following unsupervised learning models can a bank use for fraud detection?

A. Hierarchical clustering
B. Anomaly detection
C. k-means
D. DB5CAN

Answer: B

Explanation:
Explanation
Anomaly detection is an unsupervised learning technique that identifies outliers or abnormal patterns in data, which can be useful for fraud detection. Anomaly detection algorithms can learn the normal behavior of transactions and flag the ones that deviate significantly from the norm, indicating possible fraud.

NEW QUESTION # 41
R-squared is a statistical measure that:

A. Combines precision and recall of a classifier into a single metric by taking their harmonic mean.
B. Is the proportion of the variance for a dependent variable thaf' s explained by independent variables.
C. Expresses the extent to which two variables are linearly related.
D. Represents the extent to which two random variables vary together.

Answer: B

Explanation:
Explanation
R-squared is a statistical measure that indicates how well a regression model fits the data. R-squared is calculated by dividing the explained variance by the total variance. The explained variance is the amount of variation in the dependent variable that can be attributed to the independent variables. The total variance is the amount of variation in the dependent variable that can be observed in the data. R-squared ranges from 0 to 1, where 0 means no fit and 1 means perfect fit.

NEW QUESTION # 42
Which of the following is NOT an activation function?

A. Hyperbolic tangent
B. ReLU
C. Sigmoid
D. Additive

Answer: D

Explanation:
Explanation
An activation function is a function that determines the output of a neuron in a neural network based on its input. An activation function can introduce non-linearity into a neural network, which allows it to model complex and non-linear relationships between inputs and outputs. Some of the common activation functions are:
Sigmoid: A sigmoid function is a function that maps any real value to a value between 0 and 1. It has an S-shaped curve and is often used for binary classification or probability estimation.
Hyperbolic tangent: A hyperbolic tangent function is a function that maps any real value to a value between -1 and 1. It has a similar shape to the sigmoid function but is symmetric around the origin. It is often used for regression or classification problems.
ReLU: A ReLU (rectified linear unit) function is a function that maps any negative value to 0 and any positive value to itself. It has a piecewise linear shape and is often used for hidden layers in deep neural networks.
Additive is not an activation function, but rather a term that describes a property of some functions. Additive functions are functions that satisfy the condition f(x+y) = f(x) + f(y) for any x and y. Additive functions are linear functions, which means they have a constant slope and do not introduce non-linearity.

NEW QUESTION # 43
Workflow design patterns for the machine learning pipelines:

A. Separate inputs from features.
B. Represent a pipeline with directed acyclic graph (DAG).
C. Seek to simplify the management of machine learning features.
D. Aim to explain how the machine learning model works.

Answer: B

Explanation:
Explanation
Workflow design patterns for machine learning pipelines are common solutions to recurring problems in building and managing machine learning workflows. One of these patterns is to represent a pipeline with a directed acyclic graph (DAG), which is a graph that consists of nodes and edges, where each node represents a step or task in the pipeline, and each edge represents a dependency or order between the tasks. A DAG has no cycles, meaning there is no way to start at one node and return to it by following the edges. A DAG can help visualize and organize the pipeline, as well as facilitate parallel execution, fault tolerance, and reproducibility.

NEW QUESTION # 44
Which of the following text vectorization methods is appropriate and correctly defined for an English-to-Spanish translation machine?

A. Using Word2vec because in translation machines, we do not care about the order of the words.
B. Using TF-IDF because in translation machines, we need to consider the order of the words.
C. Using TF-IDF because in translation machines, we do not care about the order of the words.
D. Using Word2vec because in translation machines, we need to consider the order of the words.

Answer: D

Explanation:
Explanation
Text vectorization is a technique that converts text into numerical vectors that can be used by machine learning models. Text vectorization can use different methods to represent text features, such as word frequency, word order, word meaning, or word context. Some of the common text vectorization methods are:
TF-IDF: TF-IDF (term frequency-inverse document frequency) is a method that assigns a weight to each word based on its frequency in a document and its rarity across a collection of documents. TF-IDF can capture the importance and relevance of words for a given topic or domain, but it does not consider the order or meaning of words.
Word2vec: Word2vec is a method that learns a vector representation for each word based on its context in a large corpus of text. Word2vec can capture the semantic and syntactic similarity and relationships among words, as well as preserve the order of words.
For an English-to-Spanish translation machine, using Word2vec would be appropriate and correctly defined, because in translation machines, we need to consider the order of the words, as well as their meaning and context.

NEW QUESTION # 45
You create a prediction model with 96% accuracy. While the model's true positive rate (TPR) is performing well at 99%, the true negative rate (TNR) is only 50%. Your supervisor tells you that the TNR needs to be higher, even if it decreases the TPR. Upon further inspection, you notice that the vast majority of your data is truly positive.
What method could help address your issue?

A. Normalization
B. Oversampling
C. Quality filtering
D. Principal components analysis

Answer: B

Explanation:
Explanation
Oversampling is a method that can help address the issue of imbalanced data, which is when one class is much more frequent than the other in the dataset. This can cause the model to be biased towards the majority class and have a low true negative rate. Oversampling involves creating synthetic samples of the minority class or replicating existing samples to balance the class distribution. This can help the model learn more from the minority class and improve the true negative rate. References: [Handling imbalanced datasets in machine learning], [Oversampling and undersampling in data analysis - Wikipedia]

NEW QUESTION # 46
Which of the following is a type 1 error in statistical hypothesis testing?

A. The null hypothesis is true and fails to be rejected.
B. The null hypothesis is false and is rejected.
C. The null hypothesis is true, but is rejected.
D. The null hypothesis is false, but fails to be rejected.

Answer: C

Explanation:
Explanation
A type 1 error in statistical hypothesis testing is when the null hypothesis is true, but is rejected. This means that the test falsely concludes that there is a significant difference or effect when there is none. The probability of making a type 1 error is denoted by alpha, which is also known as the significance level of the test. A type 1 error can be reduced by choosing a smaller alpha value, but this may increase the chance of making a type 2 error, which is when the null hypothesis is false but fails to be rejected. References: [Type I and type II errors - Wikipedia], [Type I Error and Type II Error - Statistics How To]

NEW QUESTION # 47
You have a dataset with many features that you are using to classify a dependent variable. Because the sample size is small, you are worried about overfitting. Which algorithm is ideal to prevent overfitting?

A. XGBoost
B. Decision tree
C. Logistic regression
D. Random forest

Answer: D

Explanation:
Explanation
Random forest is an algorithm that is ideal to prevent overfitting when using a dataset with many features and a small sample size. Random forest is an ensemble learning method that combines multiple decision trees to create a more robust and accurate model. Random forest can prevent overfitting by introducing randomness and diversity into the model, such as by using bootstrap sampling (sampling with replacement) to create different subsets of data for each tree, or by using feature selection (choosing a random subset of features) to split each node in a tree.

NEW QUESTION # 48
Which of the following tools would you use to create a natural language processing application?

A. AWS DeepRacer
B. NLTK
C. DeepDream
D. Azure Search

Answer: B

Explanation:
Explanation
NLTK (Natural Language Toolkit) is a Python library that provides a set of tools and resources for natural language processing (NLP). NLP is a branch of AI that deals with analyzing, understanding, and generating natural language texts or speech. NLTK offers modules for various NLP tasks, such as tokenization, stemming, lemmatization, parsing, tagging, chunking, sentiment analysis, named entity recognition, machine translation, text summarization, and more .

NEW QUESTION # 49
An AI practitioner incorporates risk considerations into a deployment plan and decides to log and store historical predictions for potential, future access requests.
Which ethical principle is this an example of?

A. Fairness
B. Transparency
C. Safety
D. Privacy

Answer: B

Explanation:
Explanation
Transparency is an ethical principle that describes the degree to which an AI system can provide clear and understandable information about its inputs, outputs, processes, and decisions. Transparency can help increase trust and confidence among users and stakeholders, as well as enable accountability and responsibility for the system's actions and outcomes. Logging and storing historical predictions for potential, future access requests is an example of transparency, as it can help provide evidence and explanation for the system's recommendations, as well as facilitate auditing and feedback.

NEW QUESTION # 50
Which of the following principles supports building an ML system with a Privacy by Design methodology?

A. Collecting and processing the largest amount of data possible.
B. Understanding, documenting, and displaying data lineage.
C. Utilizing quasi-identifiers and non-unique identifiers, alone or in combination.
D. Avoiding mechanisms to explain and justify automated decisions.

Answer: B

Explanation:
Explanation
Data lineage is the process of tracking the origin, transformation, and usage of data throughout its lifecycle. It helps to ensure data quality, integrity, and provenance. Data lineage also supports the Privacy by Design methodology, which is a framework that aims to embed privacy principles into the design and operation of systems, processes, and products that involve personal data. By understanding, documenting, and displaying data lineage, an ML system can demonstrate how it collects, processes, stores, and deletes personal data in a transparent and accountable manner3 .

NEW QUESTION # 51
Which of the following metrics is being captured when performing principal component analysis?

A. Missingness
B. Variance
C. Skewness
D. Kurtosis

Answer: B

Explanation:
Explanation
Principal component analysis (PCA) is a technique that reduces the dimensionality of a dataset by transforming it into a set of new variables called principal components. The principal components are linear combinations of the original variables that capture the maximum amount of variance in the data. The first principal component explains the most variance, the second principal component explains the second most variance, and so on. The goal of PCA is to retain as much variance as possible while reducing the number of variables.

NEW QUESTION # 52
Which of the following algorithms is an example of unsupervised learning?

A. Random forest
B. Principal components analysis
C. Neural networks
D. Ridge regression

Answer: B

Explanation:
Explanation
Unsupervised learning is a type of machine learning that involves finding patterns or structures in unlabeled data without any predefined outcome or feedback. Unsupervised learning can be used for various tasks, such as clustering, dimensionality reduction, anomaly detection, or association rule mining. Some of the common algorithms for unsupervised learning are:
Principal components analysis: Principal components analysis (PCA) is a method that reduces the dimensionality of data by transforming it into a new set of orthogonal variables (principal components) that capture the maximum amount of variance in the data. PCA can help simplify and visualize high-dimensional data, as well as remove noise or redundancy from the data.
K-means clustering: K-means clustering is a method that partitions data into k groups (clusters) based on their similarity or distance. K-means clustering can help discover natural or hidden groups in the data, as well as identify outliers or anomalies in the data.
Apriori algorithm: Apriori algorithm is a method that finds frequent itemsets (sets of items that occur together frequently) and association rules (rules that describe how items are related or correlated) in transactional data. Apriori algorithm can help discover patterns or insights in the data, such as customer behavior, preferences, or recommendations.

NEW QUESTION # 53
In general, models that perform their tasks:

A. Less accurately are less robust against adversarial attacks.
B. More accurately are less robust against adversarial attacks.
C. More accurately are neither more nor less robust against adversarial attacks.
D. Less accurately are neither more nor less robust against adversarial attacks.

Answer: B

Explanation:
Explanation
Adversarial attacks are malicious attempts to fool or manipulate machine learning models by adding small perturbations to the input data that are imperceptible to humans but can cause significant changes in the model output. In general, models that perform their tasks more accurately are less robust against adversarial attacks, because they tend to have higher confidence in their predictions and are more sensitive to small changes in the input data. References: [Adversarial machine learning - Wikipedia], [Why Are Machine Learning Models Susceptible to Adversarial Attacks? | by Anirudh Jain | Towards Data Science]

NEW QUESTION # 54
Which of the following options is a correct approach for scheduling model retraining in a weather prediction application?

A. As new resources become available
B. Once a month
C. When the input format changes
D. When the input volume changes

Answer: C

Explanation:
Explanation
The input format is the way that the data is structured, organized, and presented to the model. For example, the input format could be a CSV file, an image file, or a JSON object. The input format can affect how the model interprets and processes the data, and therefore how it makes predictions. When the input format changes, it may require retraining the model to adapt to the new format and ensure its accuracy and reliability. For example, if the weather prediction application switches from using numerical values to categorical values for some features, such as wind direction or cloud cover, it may need to retrain the model to handle these changes
.

NEW QUESTION # 55
Word Embedding describes a task in natural language processing (NLP) where:

A. Words are featurized by taking a histogram of letter counts.
B. Words are converted into numerical vectors.
C. Words are featurized by taking a matrix of bigram counts.
D. Words are grouped together into clusters and then represented by word cluster membership.

Answer: B

Explanation:
Explanation
Word embedding is a task in natural language processing (NLP) where words are converted into numerical vectors that represent their meaning, usage, or context. Word embedding can help reduce the dimensionality and sparsity of text data, as well as enable various operations and comparisons among words based on their vector representations. Some of the common methods for word embedding are:
One-hot encoding: One-hot encoding is a method that assigns a unique binary vector to each word in a vocabulary. The vector has only one element with a value of 1 (the hot bit) and the rest with a value of
0. One-hot encoding can create distinct and orthogonal vectors for each word, but it does not capture any semantic or syntactic information about words.
Word2vec: Word2vec is a method that learns a dense and continuous vector representation for each word based on its context in a large corpus of text. Word2vec can capture the semantic and syntactic similarity and relationships among words, such as synonyms, antonyms, analogies, or associations.
GloVe: GloVe (Global Vectors for Word Representation) is a method that combines the advantages of count-based methods (such as TF-IDF) and predictive methods (such as Word2vec) to create word vectors. GloVe can leverage both global and local information from a large corpus of text to capture the co-occurrence patterns and probabilities of words.

NEW QUESTION # 56
......

CertNexus AIP-210 Exam Syllabus Topics:

Topic	Details
Topic 1	Identify potential ethical concerns Analyze machine learning system use cases
Topic 2	Recognize relative impact of data quality and size to algorithms Engineering Features for Machine Learning
Topic 3	Train, validate, and test data subsets Training and Tuning ML Systems and Models
Topic 4	Design machine and deep learning models Explain data collection transformation process in ML workflow

Real AIP-210 Quesions Pass Certification Exams Easily: https://www.actualtests4sure.com/AIP-210-test-questions.html

AIP-210 dumps Accurate Questions and Answers with Free: https://drive.google.com/open?id=1cgT-LFqsBopioOGQsGqO9DKbhRGMJRTa

[2024] New AIP-210 exam Free Sample Questions to Practice [Q34-Q56]

CertNexus AIP-210 Exam Syllabus Topics:

Related Blogs

Latest Update

Useful Links

Contact Us