Handsome Savings - Limited Time Offer 30% OFF - Ends In 0d 0h 0m 0s Coupon code: 50OFF
Welcome to QA4Exam
Logo

- Trusted Worldwide Questions & Answers

Databricks Databricks-Certified-Professional-Data-Scientist Exam Actual Questions

The questions for Databricks-Certified-Professional-Data-Scientist were last updated on Oct 1, 2024.
  • Viewing page 1 out of 28 pages.
  • Viewing questions 1-5 out of 138 questions
Unlock Access to All 138 Questions & Answers
Question No. 1

You are using k-means clustering to classify heart patients for a hospital. You have chosen Patient Sex, Height, Weight, Age and Income as measures and have used 3 clusters. When you create a pair-wise plot of the clusters, you notice that there is significant overlap between the clusters. What should you do?

Show Answer Hide Answer
Correct Answer: C

Question No. 2

You are asked to create a model to predict the total number of monthly subscribers for a specific magazine. You are provided with 1 year's worth of subscription and payment data, user demographic data, and 10 years worth of content of the magazine (articles and pictures). Which algorithm is the most appropriate for building a predictive model for subscribers?

Show Answer Hide Answer
Correct Answer: A

Before you model the relationship between pairs of quantities, it is a good idea to perform correlation analysis to establish if a linear relationship exists between these quantities. Be aware that variables can have nonlinear relationships, which correlation analysis cannot detect. For more information, see Linear Correlation.

If you need to fit data with a nonlinear model, transform the variables to make the relationship linear. Alternatively try to fit a nonlinear function directly using either the Statistics and Machine Learning Toolbox nlinfit function, the Optimization Toolbox Isqcurvefit function, or by applying functions in the Curve Fitting Toolbox.

79


Question No. 3

Refer to the exhibit.

You are building a decision tree. In this exhibit, four variables are listed with their respective values of info-gain.

Based on this information, on which attribute would you expect the next split to be in the decision tree?

Show Answer Hide Answer
Correct Answer: A

Question No. 4

You have collected the 100's of parameters about the 1000's of websites e.g. daily hits, average time on the websites, number of unique visitors, number of returning visitors etc. Now you have find the most important parameters which can best describe a website, so which of the following technique you will use

Show Answer Hide Answer
Correct Answer: A

Question No. 5

Which of the below best describe the Principal component analysis

Show Answer Hide Answer
Correct Answer: A

Product Image

Unlock All Questions for Databricks Databricks-Certified-Professional-Data-Scientist Exam

Full Exam Access, Actual Exam Questions, Validated Answers, Anytime Anywhere, No Download Limits, No Practice Limits

Get All 138 Questions & Answers