Exam Details
Subject | foundations of data science | |
Paper | ||
Exam / Course | m.tech | |
Department | ||
Organization | Institute Of Aeronautical Engineering | |
Position | ||
Exam Date | January, 2019 | |
City, State | telangana, hyderabad |
Question Paper
Hall Ticket No Question Paper Code: BCS001
INSTITUTE OF AERONAUTICAL ENGINEERING
(Autonomous)
M.Tech I Semester End Examinations (Supplementary) January, 2019
Regulation: IARE-R16
FOUNDATIONS OF DATA SCIENCE
Time: 3 Hours Max Marks: 70
Answer ONE Question from each Unit
All Questions Carry Equal Marks
All parts of the question must be answered in one place only
UNIT I
1. Draw the life cycle of data science project and summarize the stages of data science project.
Describe how to explore data in R. Explain graphics and visualization in spotting problems.
2. List the different R functions to read and write the data from disk and R object. Write R script
to choose the character data dynamically from user.
Explain and write the appropriate statements in R for the following operations.
• Input as CSV File(input.csv)
• Reading a CSV file
• Apply different analysis on the file
• Writing into a CSV File
UNIT II
3. Differentiate SQL and No SQL databases in detail. Give example of XML data extraction and
operations using R.
What is JSON file? Explain the input, output operations of JSON file using R.
4. Compute covariance matrix and correlation matrix for the four numerical attributes. Interpret
the statistical findings to know more about hidden nature in data.
Summarize multiple regression in R and create equation for regression model.
Page 1 of 2
UNIT III
5. How to predict whether an email is a spam and should be delivered to the junk folder. Suggest
suitable data model.
Calculate the Jaccard coefficient for the given data
p 1 0 0 0 0 0 0 0 0 0
q 0 0 0 0 0 0 1 0 0 1.
6. Outline about the learning of a model? Write any four learning techniques and in each case give
the expression for weight- updating.
Write short notes on Hierarchical clustering with hclust().
UNIT IV
7. Give the basic structure of neural network and different artificial neural network with real time
examples.
Discuss the difference of error in two hypotheses. Differentiate the MAP (maximum a posteriori)
and ML (maximum likelihood) hypothesis. Give an example of a scenario in which a MAP
hypothesis is preferable to an ML hypothesis.
8. Describe the prediction model in terms of the following measures for best fit: Residual standard
error, Multiple R-squared, F-statistic, p-value
Compare the learning algorithms with example in terms of problem nature, accuracy and error
rate.
UNIT V
9. Write R script to plot a data frame having: {df1:{red,green,blue,pink,black} df2:
using relevant plot.
Generalize the graphical analysis in data analysis. List the various plots in R and explain in
detail.
10. How would you get the multiple plots in single window? Plot the regression model along with
residuals.Write a R script for creating a boxplot of iris sepal length attribute.
Elaborate how to export a graph using graphics parameters.How to export the text data to plot
with example.
Page 2 of 2
INSTITUTE OF AERONAUTICAL ENGINEERING
(Autonomous)
M.Tech I Semester End Examinations (Supplementary) January, 2019
Regulation: IARE-R16
FOUNDATIONS OF DATA SCIENCE
Time: 3 Hours Max Marks: 70
Answer ONE Question from each Unit
All Questions Carry Equal Marks
All parts of the question must be answered in one place only
UNIT I
1. Draw the life cycle of data science project and summarize the stages of data science project.
Describe how to explore data in R. Explain graphics and visualization in spotting problems.
2. List the different R functions to read and write the data from disk and R object. Write R script
to choose the character data dynamically from user.
Explain and write the appropriate statements in R for the following operations.
• Input as CSV File(input.csv)
• Reading a CSV file
• Apply different analysis on the file
• Writing into a CSV File
UNIT II
3. Differentiate SQL and No SQL databases in detail. Give example of XML data extraction and
operations using R.
What is JSON file? Explain the input, output operations of JSON file using R.
4. Compute covariance matrix and correlation matrix for the four numerical attributes. Interpret
the statistical findings to know more about hidden nature in data.
Summarize multiple regression in R and create equation for regression model.
Page 1 of 2
UNIT III
5. How to predict whether an email is a spam and should be delivered to the junk folder. Suggest
suitable data model.
Calculate the Jaccard coefficient for the given data
p 1 0 0 0 0 0 0 0 0 0
q 0 0 0 0 0 0 1 0 0 1.
6. Outline about the learning of a model? Write any four learning techniques and in each case give
the expression for weight- updating.
Write short notes on Hierarchical clustering with hclust().
UNIT IV
7. Give the basic structure of neural network and different artificial neural network with real time
examples.
Discuss the difference of error in two hypotheses. Differentiate the MAP (maximum a posteriori)
and ML (maximum likelihood) hypothesis. Give an example of a scenario in which a MAP
hypothesis is preferable to an ML hypothesis.
8. Describe the prediction model in terms of the following measures for best fit: Residual standard
error, Multiple R-squared, F-statistic, p-value
Compare the learning algorithms with example in terms of problem nature, accuracy and error
rate.
UNIT V
9. Write R script to plot a data frame having: {df1:{red,green,blue,pink,black} df2:
using relevant plot.
Generalize the graphical analysis in data analysis. List the various plots in R and explain in
detail.
10. How would you get the multiple plots in single window? Plot the regression model along with
residuals.Write a R script for creating a boxplot of iris sepal length attribute.
Elaborate how to export a graph using graphics parameters.How to export the text data to plot
with example.
Page 2 of 2
Other Question Papers
Subjects
- ac to dc converters
- advanced cad
- advanced concrete technology
- advanced data structures
- advanced database management system
- advanced mechanics of solids
- advanced reinforced concrete design
- advanced solid mechanics
- advanced steel design
- advanced structural analysis
- advanced web technologies
- big data analytics
- computer aided manufacturing
- computer aided process planning
- computer architecture
- computer oriented numerical methods
- cyber security
- data science
- data structures and problem solving
- dc to ac converters
- design for manufacturing and assembly
- design for manufacturing mems and micro systems
- design of hydraulic and pneumatic system
- distributed operated system
- earthquake resistant design of buildings
- embedded c
- embedded networking
- embedded real time operating systems
- embedded system architecture
- embedded system design
- embedded wireless sensor networks
- english for research paper writing
- finite element method
- flexible ac transmission systems
- flexible manufacturing system
- foundations of data science
- foundations of data sciences
- fpga architecture and applications
- hardware and software co-design
- high performance architecture
- intelligent controllers
- internet of things
- introduction to aerospace engineering
- mathematical foundation of computer
- mathematical methods in engineering
- matrix methods of structural analysis
- micro controllers and programmable digital signal processing
- multilevel inverters
- numerical method for partial differential equations
- power electronic control of ac drives
- power electronic control of dc drives
- power quality
- precision engineering
- principles of distributed embedded systems
- programmable logic controllers and their applications
- rapid prototype technologies
- rehabilitation and retrofitting of structures
- renewable energy systems
- research methodology
- soft computing
- special machines and their controllers
- stress analysis and vibration
- structural dynamics
- structural health monitoring
- theory of elasticity and plasticity
- theory of thin plates and shells
- web intelligent and algorithm
- wireless lan’s and pan’s
- wireless lans and pans
- wireless sensor networks