Gitika Aggarwal
: Flat# 102, Divya Comforts,
Krishnappa Gardens, Near Sagar Apollo Hospital,
Bangalore-560041, Karnataka, INDIA.
TELEPHONE : + 91 9880 606 332
: gitika [at] students [dot] iiit [dot] net
It is my ambition to be part of the research community of a good school and be involved
in exciting and novel research in the areas of Distributed Systems and Data Management.
Research Interests
Distributed Systems
Data Management
Machine Learning
B.Tech. Computer Science and Engineering
(Honors in Data Engineering)
International Institute of Information Technology,
Hyderabad (2004)
Senior Secondary
Gautami Academy Junior College (2000)
St. Ann’s High School (1998)
CGPA: 9.05/10.0
Aggr: 91.0%
Aggr: 86.0%
Awarded for scoring the Highest Overall Percentage – Secondary school, 1998.
Recipient of the National Merit Scholarship for the Secondary State Examinations
– awarded by the National Government to the top 10% scorers in every county.
Secured 78th position among over 32,000 students in the All India Entrance
Examination for IIIT - Hyderabad, 2000.
Recipient of the Dean’s List award(awarded to top 10% of the class) during my
undergraduate study, for the following semesters – Spring 2001, Spring 2002, Fall
2003, Spring 2003, Spring 2004
BUDDIES: Bus Driven Data Dissemination System
Kamalakar Karlapalem, Gitika Aggarwal, Krishnaveni Budati
Proceedings of the 10th International Conference on High Performance
Computing(HiPC), 2003, Hyderabad, INDIA.
(Received Best Poster Presentation Award)
Internships and Teaching Assistantships
Summer Internship at CMC Limited, Hyderabad, India.
I was one of the 11 students of the 138 students from my class who were chosen
for the internship.
Teaching Assistant for the course '
Data Structures and Algorithms'
, Spring 2004.
Instructor : Dr. Madhu Mutyam (madhu_mutyam [at] iiit [dot] net )
Experience in the Industry
Position Held
Nature of Work
Name of Organization
Support Engineer
Database Administrator
Oracle Corporation (India)
06/2004 - To date
Skill Set
Programming Languages and
Programming Tools
Operating Systems
C, C++, Java, Perl
OpenGL, VRML , Matlab, Qt
Windows 98/ME/2000/XP, Linux
Honors Stream Courses and Content
Introduction to Database
Management Systems
(Spring 2002)
Distributed Database
Management Systems
(Fall 2002)
Data Warehousing and
Data Mining
(Spring 2003)
Entity-relationship model, Relational Model, Relational Algebra
and Calculus, Database Design and Normalization, Indexing
Schemes, Integrity, Concurrency control,Transaction
Management,Recovery,Query Processing, SQL.
Architectures, Distributed Database Design, Distribution
Transparency,Data Replication, Query Decomposition and
Optimization, Distributed Transaction Management, Distributed
Concurrency Control, Reliability
RDBMS Vs Data Warehouses, Data Warehouse schemas,
Applications of Data Warehouses, Sources of data, Problems with
data, Data Cleaning, Data pre-processing, Clustering ,
Classification, Decision trees, Association Rule Mining.
Select Elective Courses
Computer Graphics, Pattern Recognition, Performance Evaluation of Computer Systems,
Numerical Methods for Computer Systems, Finite Element Methods
Undergraduate Coursework
Computer Programming, Data Structures, Computer Organization, Artificial Intelligence,
Analog and Digital Circuits, Operating Systems, Computer Networks, Introduction to Databases,
Discrete Mathematics, Signal Processing, Principles of Programming Languages, Programming
Language Processors, Theory of Computation, Algorithms, Software Engineering.
Major Projects
BUDDIES - Bus Driven Data Dissemination System
(Jan 2003 – April 2004)
Project Guide: Dr. Kamalakar Karlapalem, Dr.P.Krishna Reddy
Recently, with the dramatic improvements in global interconnectivity due to the Internet,
the web community has been witnessing a boom in the number and variety of dataintensive applications. However, a simultaneous explosion in the user-base for these
applications has given rise to major scalability and reliability issues w.r.t the network as
well as the servers that host these applications. To address these issues, past research has
yielded a number of systems like those based on server-side scheduling, differentiated
services, publish-subscribe systems. BUDDIES was developed with a similar aim in
mind. BUDDIES combines the ideas of pull-based data-dissemination with those of pushbased data-dissemination; content-based routing, multicast and QoS. Comprehensive
experimentation has demonstrated that BUDDIES reduces network traffic and server-side
computations at the cost of a reasonable increase in response-time. Preliminary results of
the same were compiled into a paper that won the Best Poster Presentation award at the
International Conference on High Performance Computing (HiPC) - 2003
Design and Implementation of Visualization modules for the Indic Data Miner
(Data Mining toolkit
(Spring 2004)
Project Guide: Dr. Kamalakar Karlapalem
The project involved development of the visualization modules for Clustering and
Classification algorithms in Indic Data Miner, a Data Mining toolkit that is being
developed at the CDE. The aim of the Indic Data Miner is to support the cutting-edge
algorithms for association rules, clustering and classification. I also implemented the
BIRCH clustering algorithm and integrated it with the toolkit.
Course Projects
Design and Development of a Distributed Database System
Course: Distributed Database Management Systems (Fall 2002)
Instructor: Dr. Kamalakar Karlapalem
A Distributed Database Management System built on top of the MySQL DBMS, to
handle distributed query processing, query optimization and transaction management,
with fragmentation transparency. The DDBMS was designed to handle simple select,
insert, update and delete SQL queries.
A data mining exercise using WEKA, a Data Mining toolkit
Course: Data Warehousing and Data Mining(Spring 2003)
Instructor: Dr. Kamalakar Karlapalem
The project involved extracting useful information from a real-life tennis dataset using
WEKA, a toolkit providing a collection of machine learning algorithms for data-mining
tasks. It included the application of data pre-processing, classification, clustering and
association-rule mining algorithms. The project was revelatory in terms of exposing the
degree of thought and effort that goes into carrying out the data mining tasks such as
data cleaning, determining the appropriate data mining algorithm (depending on the
nature of the data), interpreting the final results.
Design and Development of a Database System
Course: Introduction to Database Management Systems(Spring 2002)
Instructor: Dr. Kamalakar Karlapalem
The project involved planning and designing a Relational Database System using
MySQL for the Forest Department of Andhra Pradesh(County). The objective of the
project was to get a hands-on experience on the various stages of building a relational
database for a real world environment - designing the ER Model, converting it to a
Relational Model, normalizing it and finally developing end-user applications.
Term Paper on Genetic Algorithms and Genetic Programming
Course: Machine Learning(Fall 2003)
Instructor: Dr. C.V. Jawahar
The need to create extremely robust and flexible systems is now driving many
researchers to explore the logics behind the most robust systems of all, Nature. Recent
years have seen a growing interest in the study of methods like Neural Networks, Genetic
Algorithms, Simulated Annealing that are inspired from biological processes and are
believed to be highly general and capable of solving problems for which little is known.
The aim of this project was to understand what are Genetic Algorithms and Genetic
Programming and get a fair idea of the ongoing research in the same
Conferences attended
10th International Conference on High Performance Computing (HiPC -2003),
Hyderabad, INDIA.