Curriculum Vitaé
[pdf version]

Harish Doddi


Tel: +1 650 391 8164
harishd@stanford.edu
http://cs.stanford.edu/~harishd


EDUCATION

September 2008-March 2010(Expected) Masters in Computer Science & Engineering
Stanford University, California.
2004 - 2008 Bachelor of Techonology (Hons) in Computer Science & Engineering
International Institute of Information Technology, Hyderabad(IIIT, Hyderabad), India. CGPA 9.14/10

COURSE WORK AT STANFORD

cs 140 (Operating Systems)
cs 276 (Information Retrieval and Web Search)
cs 245 (Database Systems Design)

cs 345A (Data Mining)

cs 244B (Distributed Systems)

cs 240 (Advanced topics in Operating Systems)

cs 347 (Transaction processing and Distributed Databases)

cs 271 (Principles of system design in Bio-Informatics)


SKILL SET

Operating Systems
Linux, Windows, MAC
GUI Design
QT
Programming
GNU/C, GNU/C++, STL, Java
Scripting
Python, Perl, Sed, Awk, Shell, XML/XSL
Scientific
Matlab
Web Design
HTML, CSS, PHP, CGI, Java Script, Ajax
Database
MySQL, MS-SQL, Postgres SQL


WORK EXPERIENCE

Internship Oracle Inc, Redwood Shores HQ, USA

Role : Software Development Engineer Intern

Group : Clusterware/RAC (Real Application Clusters)

Description :
As part of the internship, I worked with GIPC (Grid Inter Process Communication) layer which provided IP address abstraction to the communication layer in use. This provided high availability of the nodes in a cluster without limiting to the private network inter connects(NIC) which was in use before. The code has been included in the Oracle database 11.2g release.
Internship Amazon IDC, Hyderabad, INDIA

Role : Software Development Engineer Intern

Group : Transportation Core

Description :
As part of the internship, I developed a web GUI which involved dynamic editing based on the user given parameters. This tool helped many software engineers to make extensive use of the cgi and helped many newly joined engineers to get quickly adapted with the system.

RESEARCH EXPERIENCE

R.A Ship Research Assistant for NMBL (Neuromuscular Bio Mechanics Lab) under Prof Scott Delp

Description:
Development and Investigation of algorithms for Knee Image Segmentation of MRI Images. Presented my research work at NAMIC-AHM 2009 at MIT ( NAMIC Presentation )
R.A Ship Research Assistant for CEES (Stanford Center for Computational Earth and Environmental Science) under Prof Jerry M. Harris

Description:
Developed an application for enhancing Geological Data Manipulation.
R.A Ship Research Assistant for Esagu (a web-based personalized agro-advisory system which uses Information Technology to solve the unscientific agricultural practices.) under Prof Dr. P.K. Reddy

Description:
Developed Recommendation system for agricultural scientists in giving crop advices.

GRADUATE PROJECTS

Developed and Modified Kernel of Pintos Operating System
               Designed and Implemented the Priority Scheduler and Multilevel Feedback Queue Scheduler, System Call interface in Pintos Kernel, File System and Virtual Memory Management Unit for Pintos OS.

Distributed Replicated File system
               Designed and implemented protocol for distributed file system in which the files are replicated. The purpose was to explore a service specific protocol, relying on transactions for reliable delivery rather than conventional transport techniques.

Mazewar Game
               Developed and implemented protocol for Mazewar, a distributed multiplayer game that allows each player to control a rat in a maze and is based on the X Window System version of Mazewar.

MAJOR WORK

December 2007 - March 2008
Development of Cache for Distributed Database Management System
Dr. Kamal Karlapalem, CDE
A Semantic Cache has been built for a distributed database management system over several machines connected via LAN. The cache increased the performance of the system several times on an average and made the query execution efficient.

December 2007 - March 2008
Development of Distributed Database Management System
Dr. Kamal Karlapalem, CDE
This project aims at implementation of a distributed relational database management system over several machines connected via LAN. The DDBMS layer should support both read-only and update queries. The project proved that Distributed Database Query Processing time is efficient and reduced lot of network costs in transferring data.

Summer 2007
December 2006 - December 2007
Development of a multi-class text classifier on NASA dataset B-tech Project
A multi-class text classifier to categorize flight accidents data provided by NASA to help automate categorization of future flight accidents without any manual work. The major challenge in this project is that the dataset given to us is not complete. Certain data omissions have taken place in the middle for security purposes. So a normal person would not be able to understand the data. This will be further extended to work on other datasets like UIC dataset

Poster
November 2006 - December 2006
Implementation of "pText" datatype, an advanced version of Text Datatype CDE
"Unstructured" text always has the limitation of its inapplicability of data mining techniques on it. Though normal databases like Postgres provides "text" datatype, it doesnt satisfy the requirements of information retrieval today. In order to overcome this limitation, I designed and implemented an advanced version of text datatype in Postgres database

July 2006 - November 2006
A tool to summarize data distributed across machines Honors Project
A tool to summarize data distribution across different machines with different parameters like no of machines, size of dataset, load per machine, etc. It gives the user the most optimal distribution and demonstrates the idea by a graph. This project is funded by Google IDC, Hyderabad.

Poster
Summer 2006
An implementation of ACME on Language Data
Most of the classifier proposed performed extensive experiments to show their techniques to be both efficient and accurate. ACME ( An Associative Classifier based on Maximum Entropy) is a new classifier which is based on association rule mining. Our classifier rests on the maximum entropy principle for its statistical basis and does not assume any independence not inferred from the given dataset. An implementation of this has been tested on language data provided by Language Techonologies Research Center( LTRC ),IIIT Hyderabad.

Summer 2005
Post-processor for Indian Language OCRs, Dr. C.V.Jawahar, CVIT
A language independent post-processor for an OCR was developed and integrated with Indian language OCR system and tested in a team of three. The post-processor recieves as input the words as identified by the OCR and returns a ranked set of words that are most suitable to replace the given input if it is found to be erronous by using a dictionary. The post-processor has been tested with an Itrans plugin on two Indian languages(Malayalam and Telugu).

Project Page

COURSE PROJECTS

Information Extraction and Retrieval
This project involved the development of Named Entity Recogniser for Telugu,an Indian Language.

Software Engineering
A case study of a software development project for development of a tourist portal for France.

Web Data Knowledge Management
Implementation of paper "Combining link and Content Information in Web Search" by Matthew Richardson and Pedro Domingos.

DataWarehousing and Data Mining
Implemented the algorithm to find most frequent sequence followed by winners in a Chess Dataset.

Computer Graphics
Developed the game of tetris and implemented a graphics pipeline in OpenGl using only the in-built vertex(glVertex) primitive.

Theory of Computation
Implemented a convertor of Non-deterministic Finite State Automata to optimized Deterministic Finite state Automata.

Database Management Systems
Made a requirements analysis for a newspaper company to handle data of its classified advertisements department and designed an ER-model for Election Comission to handle database of various political parties in a decentralised system of governance and implemented a database for travel agency in MySQL to handle its resources and customers.

Operating Systems
Developed a shell with minimal functionalities and implemented a chat program using pipes.

IT-Workshop2
Developed Intranet Quotations website which has quotations classified based on various topics for browsing, a search engine and a shopping cart developed to grab any interesting quotes and a personlised recommendation system.


UNDERGRAD COURSES

Basic
Engineering Mathematics I and II, Programming, Physics I and II, Digital Logic and Design, Electronic Circuits, IT Workshop I,II and III and Electrical Science.

Core
Data Structures, Computer Organization, Algorithms, Database Management Systems, Operating Systems, Computer Networks, Theory of Computation, Principles of Programming Languages, Compilers and Software Engineering.

Advanced     
Artificial Intelligence, Linear Programming, DataWarehousing and Data Mining, Web Data Knowledge Management, Information Extraction and Retrieval

TEACHING EXPERIENCE

Operating Systems Teaching Assistant in the monsoon semester of 2006

Artificial Intelligence Teaching Assistant in the spring semester of 2006

ACADEMIC AWARDS AND HONORS

BIOGRAPHICAL




Abbreviations and Links

End of Curriculum Vitaé