1. The values of X and Y are given in figure-1 Of the image. Choose the correct value of 2X — 5Y from figure-2.
2. Which of the following types of time series analysis aims at separating periodic or cyclical components in a time series?
3. As per the Microsoft association rules model. which of the following options is the correct viewer tab that combines information about itemsets and their relative value?
4. Which of the following statements is correct about the intervention analysis type of the time series analysis?
5. Which of the following is the correct default value of the MAXIMUM_ITEMSET_SIZE parameter, which is used with the Microsoft association rules algorithm?
6. Which of the following is the correct syntax of the command that will verify the installation of the xlsx package and load the library into R workspace?
7. As per the Microsoft sequence clustering algorithm, which of the following options is the correct syntax of the Cluster (DMX) prediction function?
8. Which of the following text mining techniques can be used for finding groups of documents with similar content?
9. Which of the following is the correct default value for the INSTABILITY_SENSITIVITY parameter used with the Microsoft time series algorithm?
10. Which of the following is the correct syntax of the command used for merging two data frames, myFrame1 and myFrame2, by ID and Country?
11. From figure-2 Of the given image, select the Option representing the inverse of the matrix given in figure-1.
12. Which of the following is the correct syntax for the PredictAdjustedProbability (DMX) prediction function used with the Microsoft association rules algorithm?
13. In data mining, which of the following options is correct about the F-score measure for text retrieval?
14. Which of the following text retrieval measures is the percentage of documents, which are relevant to the query and were actually retrieved?
15. Which of the following is the correct default value of the HOLDOUT_PERCENTAGE parameter of the Microsoft logistic regression algorithm, which is used for specifying the percentage of cases within the training data used to calculate a holdout error?
16. In advanced statistics, which of the following statements is correct about the Dirichlet Regression method?
17. Which of the matrices given in figure-2 is the reduced row echelon form of the matrix given in figure-1 of the image?
18. In which of the following text mining methods, terms are analyzed on the sentence and document level?
19. As per the Microsoft association rules algorithm, which of the following parameters specifies the minimum number of cases that must contain an itemset before the algorithm generates a rule?
20. Which of the following is the correct syntax of the lsDescendant (DMX) prediction function used in data mining?
21. Consider the following parameters: control - Optional parameters for controlling boot data. frequency - Specifies the number of observations per unit time. data - Specifies the data frame. bootobject - The Object returned by the boot function. conf- The desired confidence interval. type - The type of confidence interval returned. According to bootstrapping in advanced statistics. which of the following options is the correct syntax of the boot.cio function?
22. Which of the following options is the default CLUSTERING_METHOD used by the Microsoft clustering algorithm?
23. Which of the following options is the correct return type of the PredictHistogram (DMX) prediction function used by the Microsoft logistic regression algorithm?
24. Which of the following options is the parameter of the Microsoft time series algorithm, which is used for controlling the growth of a decision tree?
25. Which of the following fundamental measures used for assessing the quality of text retrieval represent(s) the percentage of retrieved documents relevant to a query?
26. While working in a Pylab environment, which of the following options do NOT need to be imported?
27. It is given that a and b are two independent binomial variables having parameters 3,114 and 2,1/4, respectively. Find P (a + b 21).
28. The bag-of-words model is used in which of the following text mining processes?
29. For a group of 12 students, the sum of squares of differences in their ranks for science and math is given as 60. On the basis of the given information. find the value of rank correlation coefficient.
30. Which of the following clustering algorithms is used for grid-based partitioning?
31. Which of the following statements are NOT correct about the Bayesian belief network?
32. Which of the following statements is correct about the judgement sampling method?
33. Which of the following commands is used to observe the way an R object is structured? It is given that mydata is a variable where a user's data is stored.
34. In which of the following Big Data technologies, moving relevant data management, analytics and reporting tasks to where the data resides, improves speed to insight, reduces data movement and promotes better data governance?
35. In data mining, which of the following statements is NOT correct about C45 algorithm?
36. Which of the following types of association mining discovers subsequences that are common to more than the minsup sequences in a sequence database?
37. Which of the following factors is responsible for the occurrence of sampling errors?
38. In data mining, which of the following is the correct syntax for defining recall, which is used to assess the quality of text retrieval?
39. Data science is used in which of the following industries? (i) Financial services (ii) Digital advertisements (iii) Healthcare (iv) Image recognition
40. Which of the following statements is correct about the query-driven approach of data warehousing?
41. It is given that y is a Poisson variate and satisfies the condition P(y=4) = P(y=5). What are the values of mean and standard deviation of y?
42. In logistic regression. which of the given methods is used to display the conditional density plot of the binary outcome, F. on the continuous x variable?
43. Which of the following functions is used to decompose a time series with additive trend, and seasonal and irregular components?
44. In data mining, which of the following models is/are used to predict the categorical class labels?
45. In data mining, which of the following parts of a decision tree represents the outcome ofa test?
46. Which of the following statements is/are correct about an SAS differentiator?
47. Which of the following is correct about classification of data?
48. The following code represents a function performed in data mining; identify the function represented. mine comparison [as {pattern_name]} For (target_class } where {t arget_condition ) {versus {contrast_class_i } where [contrachondition_i]} analyze [measure(s) ]
49. In a generalized linear model. which of the following link functions belongs. by default, to Poisson family?
50. In data mining, which of the following is the correct syntax of the foil method, FOIL_Prune, used for rule pruning for a rule R? It is given that p is the number of positive tuples covered by R and n is the number of negative tuples covered by R.
51. Which of the following options denotes the probability of avoiding a type-ll error in hypothesis testing?
52. Which of the given options is the correct way of representing the regression equation of Y on X. given that byx is regression coefficient of Y on X?
53. In hypothesis testing. what will you call a population whose data is categorical and belongs to a collection Of discrete non-overlapping classes?
54. By default, which of the following events is/are set using the KlSSmetrics analytics tool? (i) Visited site (ii) Search engine hit
55. If there is some data with missing values and you need to read a help file of a function, say median, then which of the following is the correct R syntax to do so?
56. Which of the following is the default value of the parameter HlSTORlCAL_MODEL_GAP used in Microsoft time series algorithm?
57. Which of the following is the DMQL syntax that is used for specifying task-relevant data?
Data Analytics MCQs | Topic-wise