Exam Details
Subject | data science | |
Paper | ||
Exam / Course | me | |
Department | ||
Organization | Gujarat Technological University | |
Position | ||
Exam Date | December, 2018 | |
City, State | gujarat, ahmedabad |
Question Paper
1
Seat No.: Enrolment
GUJARAT TECHNOLOGICAL UNIVERSITY
ME SEMESTER-1 EXAMINATION WINTER 2018
Subject Code: 3710219 Date: 07/01/2019
Subject Name: Data Science
Time: 02:30 PM To 05:00 PM Total Marks: 70
Instructions:
1. Attempt all questions.
2. Make suitable assumptions wherever necessary.
3. Figures to the right indicate full mark.
MARKS
Q.1*
The test for rare disease is conducted,
where of the population is infected. It is a highly sensitive and specific test, which is not quite perfect:
• 99% of sick patients test positive.
• 99% of healthy patients test negative.
Given that a patient tests positive, what is the probability that the patient is actually sick?
03
Consider spam filter. Nonspam is called "ham". There are 1500 spam versus 3672 ham. The word "meeting" occurs 16 times in spam folder. There are 153 occurrences of the word "meeting" in ham. Compute the chance that an Email is spam only knowing it contains the word "meeting"?
04
Consider two sets of data
R={(restaurant,ranking)=("ABC","fivestars"), ("PQR","twostars"),("Z","zerostars"),("T","twostars").
How linear regression can be applied to find out trend and variation?
07
Q.2
For the prediction that student will be allowed to sit in the placement of an Infocom company. Identify the predictor variable, target variable, data type of variable.
03
How data science process differ in case of recommendation system predicting the weather.
04
Why anyone working with data should do exploratory data analysis? Consider 31 datasets, Each dataset represents one day's worth of advertisements and clicks recorded on the Today Time's home page. Each row represents a single user. There are five columns age, gender, image impression, number of clicks and logged in time. What kind of exploratory data analysis can be performed?
07
OR
What is central limit theorem? Give example for following with respect to central limit theorem.
Variable is normally distributed
Variable is greater than a certain number
Variable is less than a certain number
07
2
Q.3
Find the variance for following set of numbers
28,29,30,31,32
03
What are outliers? How to identify outliers in data?
04
Consider the following sentences for sentiment analysis
The weather is pleasant.
The devotional movie is excellent
The bicycle race is exciting.
What type of encoding can be used to represent sentiment data? Explain.
07
OR
Q.3
If a company wants to estimate growth in sales of a company based on current economic conditions. What kind of analysis company must do? What are the benefits?
03
What is the difference between API and library files. Explain the use of API for data collection.
04
The Music Timeline App illustrates a variety of music genres popular from 2010 to present day, based on how Music users have an artist or album in their library, and other data such as album release dates. Which data visualization techniques can be used to represent what kind of data? keep Music App in mind.
07
Q.4
What are the challenges for data storage and management?
03
The analysis of age is to be performed for customer visiting the mall. Which visualization techniques can be used?
04
Which methods can be used to fill the missing data? Explain the case of numerical and categorical data.
07
OR
Q.4
Which types of data are used in data science?
03
In the analysis of product category, how SVM can be applied?
04
What are retinal variables? How encoding of retinal variables is done?
07
Q.5
Which type of statistics can overcome the issue of outliers?
03
How data form multiple sources can be handled?
04
The task is to automate the assignment of new products to company's product categories, For example stereo is to be categorized as electronic system. This is which type of problem and what kind of learning can be applied? Which method best suits for this Justify.
07
OR
Q.5
"Significant skewness indicate that the mean and standard deviation are not good measures of distribution". True or False? Justify.
03
How distribution of categorical data can be calculated?
04
Explain the process of credit card transaction. How can it be verified that the transaction is fraudulent or not?
07
Seat No.: Enrolment
GUJARAT TECHNOLOGICAL UNIVERSITY
ME SEMESTER-1 EXAMINATION WINTER 2018
Subject Code: 3710219 Date: 07/01/2019
Subject Name: Data Science
Time: 02:30 PM To 05:00 PM Total Marks: 70
Instructions:
1. Attempt all questions.
2. Make suitable assumptions wherever necessary.
3. Figures to the right indicate full mark.
MARKS
Q.1*
The test for rare disease is conducted,
where of the population is infected. It is a highly sensitive and specific test, which is not quite perfect:
• 99% of sick patients test positive.
• 99% of healthy patients test negative.
Given that a patient tests positive, what is the probability that the patient is actually sick?
03
Consider spam filter. Nonspam is called "ham". There are 1500 spam versus 3672 ham. The word "meeting" occurs 16 times in spam folder. There are 153 occurrences of the word "meeting" in ham. Compute the chance that an Email is spam only knowing it contains the word "meeting"?
04
Consider two sets of data
R={(restaurant,ranking)=("ABC","fivestars"), ("PQR","twostars"),("Z","zerostars"),("T","twostars").
How linear regression can be applied to find out trend and variation?
07
Q.2
For the prediction that student will be allowed to sit in the placement of an Infocom company. Identify the predictor variable, target variable, data type of variable.
03
How data science process differ in case of recommendation system predicting the weather.
04
Why anyone working with data should do exploratory data analysis? Consider 31 datasets, Each dataset represents one day's worth of advertisements and clicks recorded on the Today Time's home page. Each row represents a single user. There are five columns age, gender, image impression, number of clicks and logged in time. What kind of exploratory data analysis can be performed?
07
OR
What is central limit theorem? Give example for following with respect to central limit theorem.
Variable is normally distributed
Variable is greater than a certain number
Variable is less than a certain number
07
2
Q.3
Find the variance for following set of numbers
28,29,30,31,32
03
What are outliers? How to identify outliers in data?
04
Consider the following sentences for sentiment analysis
The weather is pleasant.
The devotional movie is excellent
The bicycle race is exciting.
What type of encoding can be used to represent sentiment data? Explain.
07
OR
Q.3
If a company wants to estimate growth in sales of a company based on current economic conditions. What kind of analysis company must do? What are the benefits?
03
What is the difference between API and library files. Explain the use of API for data collection.
04
The Music Timeline App illustrates a variety of music genres popular from 2010 to present day, based on how Music users have an artist or album in their library, and other data such as album release dates. Which data visualization techniques can be used to represent what kind of data? keep Music App in mind.
07
Q.4
What are the challenges for data storage and management?
03
The analysis of age is to be performed for customer visiting the mall. Which visualization techniques can be used?
04
Which methods can be used to fill the missing data? Explain the case of numerical and categorical data.
07
OR
Q.4
Which types of data are used in data science?
03
In the analysis of product category, how SVM can be applied?
04
What are retinal variables? How encoding of retinal variables is done?
07
Q.5
Which type of statistics can overcome the issue of outliers?
03
How data form multiple sources can be handled?
04
The task is to automate the assignment of new products to company's product categories, For example stereo is to be categorized as electronic system. This is which type of problem and what kind of learning can be applied? Which method best suits for this Justify.
07
OR
Q.5
"Significant skewness indicate that the mean and standard deviation are not good measures of distribution". True or False? Justify.
03
How distribution of categorical data can be calculated?
04
Explain the process of credit card transaction. How can it be verified that the transaction is fraudulent or not?
07
Other Question Papers
Subjects
- 3g & 4g mobile communication
- ad hoc and wireless sensor network
- adaptive signal processing
- additives & compounding
- adv. chem. engg. thermodynamics
- advance air conditioning technology
- advance biomedical imaging
- advance casting technology
- advance control systems
- advance cryptography and information security
- advance database
- advance electrical machines
- advance heat transfer
- advance image processing
- advance industrial drives and control
- advance material technology
- advance oil hydraulic and pneumatic systems
- advance operating system
- advance operation research
- advance production & operation management
- advance signal processing & estimation
- advance stress analysis
- advance topics in textile manufacturing
- advance transport phenomena (atp)
- advance vlsi design
- advanced analytical techniques
- advanced civil engineering materials
- advanced communication networks
- advanced computer architecture
- advanced concrete design
- advanced concrete structures
- advanced construction techniques
- advanced control techniques for electrical machines
- advanced data structures
- advanced design of concrete structures
- advanced design of steel structures
- advanced device drivers - ii
- advanced digital circuit design
- advanced digital communication
- advanced digital signal processing
- advanced digital signal processing and applications
- advanced engineering dynamics
- advanced engineering materials
- advanced fabric manufacturing
- advanced fluid mechanics
- advanced foundation engineering
- advanced geotechnical engineering
- advanced image processing
- advanced internal combustion engine
- advanced kinetics and reaction engineering
- advanced machine design
- advanced mass transfer
- advanced materials processing techniques
- advanced mechanism design
- advanced metrology & experimental techniques
- advanced microcontroller and logic controllers
- advanced power converters
- advanced power electronics
- advanced power electronics devices
- advanced power system protection & switchgear
- advanced power system protection and switchgear
- advanced process optimization
- advanced process synthesis
- advanced reaction engineering
- advanced refrigeration
- advanced refrigeration engineering
- advanced seismic design of structures
- advanced separation processes
- advanced soil mechanics
- advanced solid mechanics
- advanced steel structures
- advanced thermodynamics
- advanced thermodynamics & heat transfer
- advanced thermodynamics and heat transfer
- advanced topics in textile manufacture
- advanced transport processes
- advanced welding technology
- advanced wireless and mobile networks
- advances in concrete technology and sustainable construction practices
- advances in transportation engineering
- advances in wireless communication
- ai techniques
- air & noise pollution control
- air pollution control equipment
- airport system planning and design
- algorithms for vlsi physical design automation
- alternate fuels and energy
- alternative fuels for transportation
- anaerobic biotechnologies
- analog cmos circuit design
- analysis & design of foundation systems
- analytical and numerical methods for structural engg.
- antenna engg. design
- antennas and radiating systems
- application based system for air pollution control management
- application based systems for transport of water & wastewater
- application of nanotechnology in chemical engineering
- application of power electronics in renewable energy conversion
- application of power electronics to power system
- application security
- applied biomechanics
- applied linear algebra
- applied super conductivity
- arm processor architecture and system design
- artificial intelligence
- artificial intelligence and expert systems
- artificial intelligence for information technology
- artificial intelligent application to power system
- asic design
- audio video coding & compression
- automative chassis and body engineering
- automobile maintenance & pollution control
- automobile refrigeration & a/c
- automotive aerodynamics & safety
- basics of transportation engineering
- big data analytics
- biodynamics
- bioelectricity
- biological control system and modelling
- biomass energy conversion
- biomedical image processing
- biomedical signal processing
- bioprocess &biochemical engineering
- biosensors & biomems
- biostatistics
- cad/cam systems
- cfd applications in chemical engineering
- chemical process optimization
- chemical reactor analysis
- chemical system modeling and simulation
- cleaner production in chemical industries (cpci)
- cleaner production in rubber industries
- climate change
- cloud and grid computing
- cloud computing
- cloud security
- cmos circuit design - i
- cmos circuit design - ii
- cognitive radio
- collection and conveyance of water and wastewater
- combustion engineering
- composites material technology
- computational method
- computer aided design
- computer aided machine design
- computer aided manufacturing
- computer aided process planning
- computer aided production management
- computer algorithm
- computer methods in power system analysis
- computer networks
- computer vision
- computerized process control
- concepts in mechatronics engineering
- construction contract management
- construction project management
- construction techniques
- control system theory
- cortex-m4 processor architecture and programming
- cryogenic engineering
- cryogenic fundamentals
- cryogenic plant and equipment
- cryogenic system
- cyber crime, ethics and laws
- cyber forensics
- data center managment
- data communication and networking
- data mining and data warehousing
- data science
- data structure with object oriented programming
- database management system
- database management systems
- date:25/05 /2017
- decision models in management
- design and analysis of experiments
- design and optimization of thermal system
- design for manufacturing and assembly
- design of bridges
- design of experiment
- design of experiment & statistical techniques
- design of heat exchange equipments
- design of heat exchangers
- design of hydraulic structures
- design of language processors
- design of material handling equipments
- design of tall structures
- device drivers - i
- digital control
- digital forensic
- digital image and video processing
- digital image processing
- digital image processing and applications
- digital modulation and coding
- digital protection
- digital signal processing
- digital signal processing algorithms
- digital signal processors: architecture & programming
- digital video processing
- digital vlsi design ii backend (elective i)
- disaster management
- disaster management and mitigation
- discrete time signal processing
- distributed computing and applications
- distributed database application system
- distributed operating system
- docks and harbour engineering
- earth and rockfill dams
- economic evaluation of transportation projects
- economics of energy generation & supply
- electric power distribution system
- electric vehicles
- electrical energy conservation & management
- electrical machine modelling and analysis
- electromagnetic compatibility in power electronics
- elementary machine foundation
- embedded and linux programming
- embedded and vlsi signal processing
- embedded system for instrumentation
- embedded systems
- embedded systems for biomedical applications
- embedded wireless technologies
- emc in power electronics
- energy and mass integration
- energy audit and management
- energy conservation & management
- energy conversion systems
- energy economics and management
- energy efficient electrical systems
- energy management
- energy resources economics and environment
- energy technology
- engineering economics & financial management
- engineering optimization
- english for research paper writing
- environment impact assessment of transportation project
- environmental chemistry & microbiology
- environmental geotechnology
- environmental impact assessment
- environmental legislation
- environmental legislations & management
- environmental modeling
- environmental monitoring
- ethical hacking
- ethical hacking & cyber law
- exergy analysis of thermal systems
- experimental techniques and instrumentations in automobile engineering
- experimental techniques and instrumentations in thermal engineering
- facility planning and design
- facts
- fiber optic communication
- finite element method
- finite element method in structural engineering
- finite element methods
- finite element methods in geotechnical engineering
- first course in optimization techniques
- flexible ac transmission system
- flexible manufacturing system
- flood management
- fluid mechanics and gas dynamics
- fluidization engineering
- fluvial hydraulics
- fundamentals of ic engines and automobiles
- fundamentals of micro mechatronics systems
- geo informatics in construction management
- geo spatial techniques
- geospatial techniques and planning
- geosynthetics and reinforced earth
- ground improvement techniques
- groundwater management
- harmonic measurements and filtration techniques
- hdl based design with programmable logic
- high speed cmos vlsi circuit
- high speed diesel engine
- higher engineering mathematics
- highway materials and construction
- hospital administration & management
- hydraulic & pneumatic systems in automotive vehicles
- hydro system engineering
- hydrogen & fuel cell technology
- hydrology & watershed management
- hydropower engineering
- image processing
- image processing for instrumentation
- indusrial hygine & safety
- industrial biotechnology
- industrial data networks
- industrial drives
- industrial electronics & control
- industrial hygiene & safety
- industrial pollution control
- industrial water & wastewater treatment
- information security
- information system and network security
- information theory & coding
- information theory and coding
- infrastructure & transportation planning
- infrastructure and transportation planning
- infrastructure projects
- intelligent sensor and instrumentation
- intelligent systems and control
- internet technology
- internetworking & application
- introduction to artificial intelligence
- introduction to biomedical engineering
- introduction to cryptography
- introduction to optimization techniques
- it infrastructure management
- it service management
- it systems and management
- lean manufacturing system and implementation
- legal issues in urban planning
- logistic and supply chain management
- logistics and supply chain management
- low temperature measurement and instrumentation
- machine tool design
- machining science
- mathematical and statistical methods in chemical engineering
- mathematical foundation for cyber security
- matrix analysis of framed structures
- matrix methods of structural analysis
- mechanics and manufacturing of compositesautomotive aerodynamics & safety
- mechanics of metal forming
- mechatronics
- mechatronics signal processing
- medical ethics and standards
- medical instrumentation & systems
- metrology & computer aided inspection
- metrology and computer aided inspection
- micro and nano manufacturing system
- microcontrollers and programmable digital signal processors
- microwave integrated circuits
- mixed signal controllers
- mixing of rubbers (mr)
- modeling and analysis of electric machines
- modelling & simulation of rubber processing (msrp)
- modelling and analysis of electrical machines
- modern control systems
- multibody dynamics
- mutli gate transistors
- network defence
- network programming
- neuro computing and applications
- numerical method
- numerical method for computer engineering
- numerical methods and statistical analysis
- numerical methods and statistical analysis for chemical engineering
- numerical methods for civil engineering
- object oriented methodology & design
- object oriented programming and with data structure
- off-shore structures
- oil hydraulics and pneumatics
- oop with java
- operation planning & control techniques
- operations planning and control techniques
- optical networks
- optimization in rubber industries
- optimization techniques for engineers
- optimization theory and practice
- pattern recognization
- pavement design, construction and evaluation
- peripheral system design and interfacing
- petroleum refinery engineering
- physics of mos transistor
- physics of rubber elasticity
- pki and biometrics
- planning history and theory
- planning, scheduling & control of construction projects
- plastic processing technology
- plastics materials
- plastics mould & product design simulations
- plastics packaging technology
- plastics processing technology
- plastics testing technology
- plates and shells
- politics & public policy planning
- politics and public policy planning
- polymer alloys and blends
- polymer blends and alloys
- polymer science and technology
- powder & particulate rubber technology
- powder and particulate technology
- power conditioning
- power converters-i
- power efficient vlsi design
- power electronics
- power electronics – i
- power electronics – ii
- power electronics converters and applications
- power electronics for power system
- power processing circuits
- power quality
- power quality issues and their mitigation techniques in power system
- power system dynamics & control
- power system dynamics and control
- power system modeling and simulation
- power system restructuring
- power system transients
- pressure vessel and piping system design
- prestressed concrete
- probability and random process
- process & quality control in textile
- process auxiliaries and utilities
- process control and optimization
- process intensification & integration (pii)
- process modelling & simulation
- process safety management
- product automation and cnc technology
- product design
- product design for manufacturing
- product development and innovation (major elective-ii)
- production & operation management
- production management systems
- programmable logic controller
- property prediction for mixtures
- public transportation planning
- pwm converter and applications
- quality control and reliability
- quality control and safety management in construction
- quality engineering & six sigma fundamentals
- radar signal processing
- rail transportation system planning & design
- rapid prototyping and tooling
- rapid prototyping, tooling and synergic integration
- real time operating system
- real time operating systems
- regional and mass transportation system planning
- regional planning
- rehabilitation and retrofitting of buildings
- rehabilitation and retrofitting of structures
- remote sensing and its application
- renewable energy engineering
- research methodology
- resources management
- rf and microwave
- rf integrated circuits
- road safety audit
- robotic engineering
- robotic engineering (mechatronics)
- robotics & control
- robotics and artificial intelligence
- robotics and intelligent systems
- robotics engineering
- robust design
- rtl simulation and synthesis with plds
- rubber blends
- rubber bonding & its technology
- rubber cultivation & rubber lattices
- security standards and audit (elective-i)
- semantic web
- sensor signal processing
- sensor technology
- service oriented architecture
- sheet metal process
- signal analysis and transform
- silicon on insulator
- simulation modeling of manufacturing system
- smart antennas for wireless communication
- smart grid technology and applications
- smart sensors and internet of things
- soft computing
- software engineering methodology
- software project management
- soil improvement technology
- soil structure interaction
- solar energy engineering
- solar refrigeration and air conditioning
- solar refrigeration and air-conditioning
- solid & hazardous waste management
- solid state ac drives
- solid state dc drives
- speciality elastomers and its technology
- speech signal processing
- statistical information processing
- statistical signal analysis
- statistical techniques and design of experiment
- statistics for biomedical engineers
- statistics for engineers
- strategic management
- structural dynamics
- structural dynamics and earthquake engineering
- structural optimization
- subject name:
- subsurface investigation & instrumentations
- surface science and nano technology
- sustainable construction practices
- system design
- telecom switching system ,networks and network management
- telecom switching system, networks and network management
- testing and verification of vlsi design
- textured yarn technology
- theory & design of textile machine - i
- theory and applications of cement composites
- theory and design of textile machine i
- theory of elasticity
- theory of elasticity & plasticity
- theory of fabric structures
- theory of thin plates & shells
- theory of yarn manufacture
- theory of yarn structure
- thermal and nuclear power plants
- thermoplastics elastomers
- thermosetting resins & silane technology(trst)
- tool & die design
- total quality management
- traffic engineering
- traffic flow theory and simulation
- transportation facility design
- transportation system management
- treatment process design and drawing
- tribology
- urban governance & development management
- urban housing
- urban planning techniques & practice
- urban transportation systems planning
- vacuum engineering
- value engineering
- verification methodology
- vibration and noise
- video processing
- virtual biomedical instrumentation
- vlsi signal processing
- water and wastewater technologies
- water resource planning
- water supply and drainage
- water use management
- wavelet transform and applications
- wavelet transforms and applications
- web and database security
- wind and small hydro energy system
- wireless & mobile communication
- wireless adhoc network
- wireless and mobile network architectures
- wireless communication
- wireless networking & mobile computing
- wireless sensor network for it
- wireless sensor networks & its energy management
- wireless signal propagation and fading
- work system design and human factors engineering