Exam Details
Subject | big data analytics | |
Paper | ||
Exam / Course | m.tech | |
Department | ||
Organization | Institute Of Aeronautical Engineering | |
Position | ||
Exam Date | January, 2018 | |
City, State | telangana, hyderabad |
Question Paper
Hall Ticket No Question Paper Code: BCS212
INSTITUTE OF AERONAUTICAL ENGINEERING
(Autonomous)
M.Tech I Semester End Examinations (Regular) January, 2018
Regulation: IARE-R16
BIG DATA ANALYTICS
(Computer Science and Engineering
Time: 3 Hours Max Marks: 70
Answer ONE Question from each Unit
All Questions Carry Equal Marks
All parts of the question must be answered in one place only
UNIT I
1. What are the elements of Big Data? Justify how it is both distributed and parallel computed?
Explain with an example of exporting all data onto cloud (AWS/Rackspace).
List out various data generators and describe how to manage data its architecture for analysis.
Assuming a prior probability of 0.3 that a given marketing communication will result in a website
visit, how many marketing emails were likely opened if you received 60 website visits?
2. Define Big Data analytics and state what made it powerful? What are the benefits of the Master
Data Management solution?
List out the steps that are to be taken reasonably in modelling when a model has to be built for
a dataset of 200 patients with 4,000 variables including an indicator of whether or not they had
developed cancer in the past year.
UNIT II
3. Write tools and list the responsibilities of people who bring needed analytical skills to a project
for maximizing the benefits of Big Data analytics for organizations.
State the reason for the problem and procedure to overcome, when an attempt is done to model
the price of the cars at auction and you find that your model has trained well, but subsequently
does poorly on new data.
4. Determine the advantages of Big Data analytics strategy which are often defined by the three
volume, variety and velocity with examples.
Compare reporting and analysis in the context of Big Data analytics which would help to improve
business.
UNIT III
5. Explain the installation process of HDFS and explain one of the best ways to assess a model
usefulness which is built when marketing department of a company is looking for a way to call
customers who are likely to churn and persuade them to stop.
Justify how Map Reduce can best be described as a programming model used to develop Hadoop
based applications that can process massive amounts of unstructured data with an example.
Page 1 of 2
6. Distinguish between Name node and Data node. Describe with suitable example.
What are the techniques used to optimize map reduce job? Explain about the partitioning,
shuffle and sort phase.
UNIT IV
7. Explain about the Hadoop input and output with an example and write a note on data integrity?
Sketch daemons architecture in Hadoop distributed file system? What are the applications of
Hadoop and Hadoop YARN?
8. Discuss Hadoop streaming with Ruby and Python programming language with at least one
example.
Which of the components retrieve the input splits directly from HDFS to determine the number
of map tasks.
UNIT V
9. In social network analysis, how do you measure the centrality of a node based on the number of
links each node has? Explain with an example. Describe the key elements of social media.
Compare on different analytics like sentiment analytics, social media analytics and web analytics.
10. Write the steps and procedure to download tweets of interest using Python or R.
List out the reasons why mobile analytics is important? List the different performance metrics
to measure the success of mobile application.
Page 2 of 2
INSTITUTE OF AERONAUTICAL ENGINEERING
(Autonomous)
M.Tech I Semester End Examinations (Regular) January, 2018
Regulation: IARE-R16
BIG DATA ANALYTICS
(Computer Science and Engineering
Time: 3 Hours Max Marks: 70
Answer ONE Question from each Unit
All Questions Carry Equal Marks
All parts of the question must be answered in one place only
UNIT I
1. What are the elements of Big Data? Justify how it is both distributed and parallel computed?
Explain with an example of exporting all data onto cloud (AWS/Rackspace).
List out various data generators and describe how to manage data its architecture for analysis.
Assuming a prior probability of 0.3 that a given marketing communication will result in a website
visit, how many marketing emails were likely opened if you received 60 website visits?
2. Define Big Data analytics and state what made it powerful? What are the benefits of the Master
Data Management solution?
List out the steps that are to be taken reasonably in modelling when a model has to be built for
a dataset of 200 patients with 4,000 variables including an indicator of whether or not they had
developed cancer in the past year.
UNIT II
3. Write tools and list the responsibilities of people who bring needed analytical skills to a project
for maximizing the benefits of Big Data analytics for organizations.
State the reason for the problem and procedure to overcome, when an attempt is done to model
the price of the cars at auction and you find that your model has trained well, but subsequently
does poorly on new data.
4. Determine the advantages of Big Data analytics strategy which are often defined by the three
volume, variety and velocity with examples.
Compare reporting and analysis in the context of Big Data analytics which would help to improve
business.
UNIT III
5. Explain the installation process of HDFS and explain one of the best ways to assess a model
usefulness which is built when marketing department of a company is looking for a way to call
customers who are likely to churn and persuade them to stop.
Justify how Map Reduce can best be described as a programming model used to develop Hadoop
based applications that can process massive amounts of unstructured data with an example.
Page 1 of 2
6. Distinguish between Name node and Data node. Describe with suitable example.
What are the techniques used to optimize map reduce job? Explain about the partitioning,
shuffle and sort phase.
UNIT IV
7. Explain about the Hadoop input and output with an example and write a note on data integrity?
Sketch daemons architecture in Hadoop distributed file system? What are the applications of
Hadoop and Hadoop YARN?
8. Discuss Hadoop streaming with Ruby and Python programming language with at least one
example.
Which of the components retrieve the input splits directly from HDFS to determine the number
of map tasks.
UNIT V
9. In social network analysis, how do you measure the centrality of a node based on the number of
links each node has? Explain with an example. Describe the key elements of social media.
Compare on different analytics like sentiment analytics, social media analytics and web analytics.
10. Write the steps and procedure to download tweets of interest using Python or R.
List out the reasons why mobile analytics is important? List the different performance metrics
to measure the success of mobile application.
Page 2 of 2
Other Question Papers
Subjects
- ac to dc converters
- advanced cad
- advanced concrete technology
- advanced data structures
- advanced database management system
- advanced mechanics of solids
- advanced reinforced concrete design
- advanced solid mechanics
- advanced steel design
- advanced structural analysis
- advanced web technologies
- big data analytics
- computer aided manufacturing
- computer aided process planning
- computer architecture
- computer oriented numerical methods
- cyber security
- data science
- data structures and problem solving
- dc to ac converters
- design for manufacturing and assembly
- design for manufacturing mems and micro systems
- design of hydraulic and pneumatic system
- distributed operated system
- earthquake resistant design of buildings
- embedded c
- embedded networking
- embedded real time operating systems
- embedded system architecture
- embedded system design
- embedded wireless sensor networks
- english for research paper writing
- finite element method
- flexible ac transmission systems
- flexible manufacturing system
- foundations of data science
- foundations of data sciences
- fpga architecture and applications
- hardware and software co-design
- high performance architecture
- intelligent controllers
- internet of things
- introduction to aerospace engineering
- mathematical foundation of computer
- mathematical methods in engineering
- matrix methods of structural analysis
- micro controllers and programmable digital signal processing
- multilevel inverters
- numerical method for partial differential equations
- power electronic control of ac drives
- power electronic control of dc drives
- power quality
- precision engineering
- principles of distributed embedded systems
- programmable logic controllers and their applications
- rapid prototype technologies
- rehabilitation and retrofitting of structures
- renewable energy systems
- research methodology
- soft computing
- special machines and their controllers
- stress analysis and vibration
- structural dynamics
- structural health monitoring
- theory of elasticity and plasticity
- theory of thin plates and shells
- web intelligent and algorithm
- wireless lan’s and pan’s
- wireless lans and pans
- wireless sensor networks