projects
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revisionNext revisionBoth sides next revision | ||
projects [2011/09/14 18:19] – dymond | projects [2015/08/26 21:52] – jarek | ||
---|---|---|---|
Line 1: | Line 1: | ||
- | ====== | + | ====== |
- | (Listed in order received.) | + | \\ |
+ | =====Clustering High-Dimensional Data Sets===== | ||
- | ====== Building an autonomous motorboat ====== | + | **Supervisor: |
- | **Supervisor**: | + | Clustering is a basic technique for analyzing data sets. Clustering is the process of grouping data points in a way that points within a group are |
+ | more similar to each other than points in other clusters. Many clustering algorithms have been developed over the years. However no single algorithm works well for all data sets. Further, most clustering algorithms have running times of the order of n^2 or n^3, so that they are not feasible for data sets with hundreds of thousands of points. In this project we will design good clustering algorithms for large real data sets. In particular we are interested in | ||
+ | Biological data sets. | ||
- | **Required Background**: General CSE408x prerequisites | + | Our data sets will include those obtained from Flow Cytometry data. Flow Cytometry is a common technique in many areas of Biology, particularly Immunology. Typical usage involves testing a blood sample for 25 attributes on a per-cell basis, and thus typical data sets are arrays of 500,000 points in a 25 dimensional space. The aim is to identify clusters that correspond to a biologist' |
- | **Recommended Background**: | + | No Biology knowledge is required. The student should be a strong programmer. Knowledge of C/C++ is desirable but not essential. The work involves reading and understanding existing algorithms and working with the supervisor to design and implement improved algorithms and to measure the performance of the proposed algorithm(s). |
- | __Description__ | + | For more information, |
- | An opportunity exists for a small number of students | + | |
+ | Required Background: General CSE408x prerequisites | ||
- | ---- | ||
- | : | ||
- | ---- | ||
- | ====== Athenians Data Project ====== | ||
- | **Supervisor**: | ||
- | **Required Background**: | + | =====Metaheuristic-based Optimization techniques===== |
- | **Recommended Background**: Data Mining | + | **Supervisor: Suprakash Datta** |
- | __Description__ | + | Optimization |
- | The Athenians Project | + | |
- | Possible project ideas for this term span from simpler ones such as | + | |
- | how to present data in the best possible way, add spatial characteristics to existing data, | + | |
- | add multimedia data, improve text searching, etc. to more complex ideas such as filling | + | |
- | missing parts for the " | + | |
- | words has been done in the past using expert knowledge. Those experts have establish | + | |
- | certain | + | |
- | when talking | + | |
- | the database with some likelihood. Associating probabilities with hypotheses introduces | + | |
- | another opportunity | + | |
- | ---- | + | |
- | : | + | |
- | ---- | + | |
- | ====== Three-Dimensional Context from Linear Perspective for Video Surveillance Systems ====== | + | The student should be a strong programmer. A good grasp of algorithms and knowledge of C/C++ are desirable but not essential. The work involves reading and understanding existing algorithms and working with the supervisor to design and implement improved algorithms and to measure the performance of the proposed algorithm(s). |
- | **Supervisor**: | + | For more information, |
- | **Requirements**: Good facility with applied mathematics | + | Required Background: General CSE408x prerequisites |
- | __Description__ | + | =====Data visualization in Skydive===== |
- | To provide visual surveillance over a large environment, | + | **Supervisor: |
- | This problem can be addressed by automatically pre-mapping two-dimensional surveillance video data into three-dimensional coordinates. | + | Skydive is a prototype system designed for database visualization using a concept of the so called |
+ | data pyramid. The system is composed of three modules (DB - Database Module, D2I - | ||
+ | Data-to-Image module, and VC - Visualizaton Client). Each is designed to use a different type | ||
+ | of computer memory. The DB module uses disk to store and manage | ||
+ | data pyramids. The D2I module works with a small subset | ||
+ | and stores data in main memory (RAM). The VC module uses the graphic card’s capabilities to | ||
+ | perform | ||
+ | graphical representation of the data. | ||
+ | Currently the system support three presentation models implemented within the Visualization | ||
+ | Component, namely: | ||
- | Mapping surveillance video to three-dimensional coordinates requires construction of a virtual model of the three-dimensional scene. | + | • a 2D heat-map; |
- | This project will investigate | + | • a 2.5 D heat-map by 3D barchart; and |
- | Although the Manhattan world assumption provides powerful constraints, | + | • a 2.5 D terrain (by mesh and UV-mapping). |
- | The student will work closely with graduate students and postdoctoral fellows at York University, | + | The goal of the project is to implement two additional ways of data visualization |
+ | extend some of existing ones, that is: | ||
- | For more information on the laboratory: [[http:// | + | 1. Implement and test functions for data pyramid-based visualization of time series. |
- | ---- | + | 2. Implement functions for visualization based on cross-product of data pyramids. |
- | : | + | |
- | ---- | + | |
- | ====== Estimating Pedestrian | + | 3. Add support for specular |
- | **Supervisor**: James Elder | + | Required Background: CSE 3421, Java programming course, (C programming course a plus) |
- | **Requirements**: | ||
- | __Description__ | + | \\ |
+ | =====Genome-wide identification of plant micro RNAs===== | ||
- | Facilities planning at both city (e.g., Toronto) and institutional (e.g., York University) scales requires accurate data on the flow of people and vehicles throughout the environment. | ||
- | The density of permanent urban video surveillance camera installations has increased dramatically over the last several years. | + | **Supervisor: |
- | This project will explore the use of computer vision algorithms for the automatic estimation of pedestrian and vehicle flows from video surveillance data. The ultimate goal is to provide planners with accurate, continuous, up-to-date information on facility usage to help guide planning. | ||
- | The student will work closely with graduate students and postdoctoral fellows at York University, as well as researchers at other institutions involved | + | The Hudak Lab in the Biology Department has an opening for a fourth-year Honours student to assist with a bioinformatics |
- | For more information on the laboratory: [[http:// | + | Working with the support of a PhD student, your project will involve: |
- | + | ||
- | ---- | + | |
- | : | + | |
- | ---- | + | |
- | ====== Tandem repeat detection using spectral methods ====== | + | 1) Prediction of micro RNA (miRNA) targets on the basis of complementary sequence matches |
- | **Supervisor**: | + | 2) Correlation of miRNA and mRNA expression changes to identify genes that are regulated by miRNAs |
- | **Required Background**: | + | 3) Conducting pathway analysis to determine which biological processes are controlled by miRNAs |
- | **Recommended Background**: | + | 4) Construction of a miRNA/ |
+ | This work will contribute to a scientific manuscript on miRNA-mediated gene regulation | ||
- | __Description__ | + | Requirements: |
- | DNA sequences of organisms have many repeated substrings. These are called repeats in Biology, and include both exact as well as approximate repeats. Repeats are of two main types: interspersed repeats (which are spread across a genome) and tandem repeats, which occur next to each other. Tandem repeats play important roles in gene regulation and are also used as markers that have several important uses, including human identity testing. | + | |
- | Finding tandem repeats is an important problem in Computational Biology. The techniques that have been proposed for it fall into two classes: string matching algorithms and signal processing techniques. In this project, we will explore fast, accurate algorithms for detecting tandem repeats and evaluate the outputs of the algorithms studied by comparing their outputs with those of available packages, including mreps (http:// | + | 1) Pre-requisites as per EECS Calendar |
- | The student will implement existing spectral algorithms based on Fourier Transforms and on an autoregressive model. He will then make changes suggested by the supervisor, and evaluate the effect of the modifications. Throughout the course, the student is required to maintain a course Web site to report any progress and details about the project. | + | 2) Facility with script-writing/ |
+ | 3) Preference for students with knowledge of statistics and familiarity with R programming | ||
+ | 4) Able to begin in September 2015 | ||
+ | Learning outcomes: | ||
- | ---- | + | 1) Manipulate and analyze quantitative biological data |
- | : | + | |
- | ---- | + | |
- | ====== Touch- | + | 2) Develop |
- | **Supervisor**: | + | 3) Manage a CentOS computer server to store and facilitate ongoing research |
- | **Required Background**: | + | No knowledge of biology is required. |
- | CSE3461 (or equivalent), | + | |
- | A student wishing to do this project must be well versed in Java, Eclipse, and developing java code for the Android operating system. | + | |
+ | For more information, | ||
+ | Hudak Lab website- http:// | ||
- | **Recommended Background**: | + | RNA sequencing- http:// |
- | Possession of an Android touch-based phone or tablet would be an asset, but is not essential. | + | |
- | __Description__ | + | miRNAs- http://en.wikipedia.org/ |
- | This project involves extending a touch-based text entry method to include automatic error correction. The method, as is, uses Graffiti strokes entered via a finger on a touch-based Android tablet. | + | |
+ | \\ | ||
+ | =====Dynamic Interface Detection and Control Project===== | ||
+ | **Supervisor: | ||
- | ---- | ||
- | : | ||
- | ---- | ||
- | ====== Early Breast Cancer Detection based on MRI’s. ====== | ||
- | **Supervisor**: | + | Contrary to most industries, fine chemical manufacturing is dominated by batch production methods. Increasing economic, environmental and safety pressures are motivating a turn towards continuous synthesis. Rather than making products in one big flask, continuous synthesis involves performing chemical reactions by flowing reagents through a tube. Working in this way provides more control over the reaction parameters leading to increases in product quality, and process efficiency and safety. The flow chemistry industry for fine chemical production is a relatively new but burgeoning field with a projected market capacity of billions of dollars by 2018. |
- | **Required Background**: | + | Extraction of the reaction mixture for purification and/or further processing is an important step in chemical manufacturing. This is a relatively straightforward operation in batch production, but offers several challenges for flowing processes. In order to facilitate continuous liquid extraction we require a sophisticated control system. This project involves designing, constructing and evaluating a pertinent practical problem in the field. |
- | **Recommended background**: | + | A key step in the process takes place in a clear tube that is mounted vertically. The tube contains two fluids with a boundary between them. During the process material flows into and out of the tube from the top and the bottom. Chemical reactions take place within this tube and It is essential that the position of the boundary be monitored as its position in the tube is used to control the flow of materials into the tube. |
- | Project Description: | + | One way of solving |
- | processing techniques for early detection | + | |
- | modalities. In particular, we propose to develop time reversal beamforming imager, | + | |
- | based on our earlier work in time reversal signal processing, for detecting early stage | + | |
- | breast cancer tumours from MRI data. | + | |
- | Our preliminary work has illustrated the type of | + | |
- | results that are possible for breast cancer detection by applying time reversal signal | + | |
- | processing on MRI breast data. In this research, we propose | + | |
- | provide | + | |
- | in MRI based breast cancer detection | + | |
- | a local hospital, and running our algorithms on these datasets. The first step is | + | |
- | important to check the validity | + | |
- | estimated locations of the tumours (as derived with our algorithms) | + | |
- | locations as identified by the pathologists. The second step will quantify the accuracy | + | |
- | of our estimation algorithms. | + | |
- | ---- | + | Specific goals of the project include: |
- | : | + | |
- | ---- | + | |
- | ====== Developing Fast Speech Recognition Engine using GPU ====== | + | |
- | **Supervisor**: | + | - Develop a computer vision system that can detect and monitor the interface between two miscible fluids of different density. |
- | **Required Background**: | + | - Evaluate the performance of the system over a range of different (and typical) fluids |
- | General prerequisites | + | |
+ | - Explore the use of different illuminant/ | ||
- | __Description__ | + | The successful candidate(s) will have the experience of working with a diverse group of scientists and engineers toward the design and implementation of an automated liquid extraction device with applications across many industries. Upon successful prototyping, |
- | Recently, Graphics Processing Units (GPU' | + | For further information please contact, |
- | During the recent years, there is an increasing demand in the job market for programmers who can use GPU's for general purpose computing tasks. This project will serve as a perfect vehicle for you to learn such a cutting-edge programming skill. | + | Michael Jenkin (jenkin@cse.yorku.ca) or Michal Organ (organ@yorku.ca) |
- | References | + | \\ |
+ | ====== DDoS Attack using Google-bots ====== | ||
- | [1] Kisun You, Jike Chong, Youngmin Yi, Gonina, E., Hughes, C.J., Yen-Kuang Chen, Wonyong Sung, Keutzer, K., " | + | **Supervisor**: Ntalija Vlajic |
- | [2] Jike Chong, Ekaterina Gonina, Youngmin Yi, Kurt Keutzer, "A Fully Data Parallel WFST-based Large Vocabulary Continuous Speech Recognition on a Graphics Processing Unit," Proc. of Interspeech 2009, Brigton, UK, 2009. | + | **Recommended Background**: |
- | ---- | + | |
- | : | + | |
- | ---- | + | |
- | ====== Solving Polynomials ====== | + | |
- | **Supervisor**: Mike McNamee | + | Not long ago, botnets - networks of compromised computers - were seen as |
+ | the most effective (if not the only) means of conducting Distributed Denial | ||
+ | of Service (DDoS) attacks. However, with the growing popularity and prevalence | ||
+ | of application-layer over other types of DDoS attacks, the DDoS execution | ||
+ | landscape is becoming increasingly more diverse. An especially interesting | ||
+ | new trend is the execution of application-layer DDoS attacks by means of | ||
+ | skillfully manipulated Web-crawlers, | ||
+ | The goal of this project is to design, implement and test a real-world | ||
+ | framework consisting of the following: a) the attacker' | ||
+ | domain specially designed to attract Google-bots and then manipulate them | ||
+ | into generating attack traffic towards the target/ | ||
+ | victim' | ||
+ | component, the project will also look into the statistical/ | ||
+ | estimation of the framework' | ||
+ | to an actual (real-world) target/ | ||
- | **Required Background**: | ||
- | General prerequisites plus course in Numerical Methods, and knowledge of programming, | ||
- | __Description__ | ||
- | In this project you will compare several efficient methods for solving polynomials. | + | \\ |
- | ---- | + | ====== |
- | : | + | |
- | ---- | + | |
- | ====== | + | |
- | **Supervisor**: | + | **Supervisor**: |
- | **Required Background**: | + | **Required Background**: |
- | Some knowledge of microprocessor architecture | + | good math skills, |
- | __Description__ | + | One of the challenges in remote learning is to allow students to communicate effectively with the lecturer. |
+ | |||
+ | The goal of this project is to apply attentive sensing technology (www.elderlab.yorku.ca) to this problem. | ||
+ | |||
+ | In particular, working with a senior graduate student or postdoctoral fellow, the successful applicant will: | ||
+ | |||
+ | - Study the problem of detecting hand-raises in the preattentive sensor stream | ||
+ | - Implement algorithms for detecting hand-raises based upon this investigation | ||
+ | - Evaluate these algorithms in a real-classroom setting, using proprietary attentive sensing technology | ||
- | Every microprocessor is supported by a variety of software tools, such as assemblers, disassemblers, | ||
- | Background Information: | + | ====== Attentive Sensing for Sport Video Recording Markets ====== |
- | manufactured in Canada (by Microsystems International Ltd, or MIL) and one of the earliest microprocessors ever produced. The microprocessor was used, among other applications as the CPU of the CPS-1 microcomputer. Although none of the CPS/1 computers (nor MF7114 software) have survived, technical information about the microprocessor and the CPS-1 has been preserved. This makes the design and | + | |
- | implementation of an assembler possible. More information on | + | |
- | http:// | + | **Supervisor**: James Elder |
- | ---- | + | **Required Background**: Good programming skills; Good math skills; Knowledge of C and MATLAB programming languages |
- | : | + | |
- | ---- | + | |
- | ====== MF7114 Debugger ====== | + | |
- | **Supervisor**: | + | |
- | + | The goal of this project | |
- | **Required Background**: | + | |
- | Some knowledge | + | The general problem |
- | + | ||
- | __Description__ | + | The student will work with senior graduate students, postdoctoral fellows |
- | + | ||
- | Every microprocessor | + | 1. Ground-truth available datasets |
- | + | 2. Evaluate current attentive algorithms | |
- | Background Information: | + | 3. Modify these algorithms to improve performance on these datasets |
- | manufactured in Canada (by Microsystems International Ltd, or MIL) and one of the earliest microprocessors ever produced. The microprocessor was used, among other applications as the CPU of the CPS-1 microcomputer. Although none of the CPS/1 computers (nor MF7114 software) have survived, technical information about the microprocessor and the CPS-1 has been preserved. This makes the design and | + | |
- | implementation of a debugger possible. More information | + | ------------ |
- | + | ||
- | http://www.cse.yorku.ca/ | + | |
- | + | ||
- | ---- | + | |
- | : | + | |
- | ---- | + | |
- | + | ||
- | ======Web Crawlers Behaving Like Humans: Are We There Yet!? ====== | + | |
- | + | ||
- | **Supervisor**: | + | |
- | + | ||
- | **Required Background**: | + | |
+ | \\ | ||
+ | ====== JPF in a Jar ====== | ||
- | __Description__ | + | **Supervisor: |
+ | Description: | ||
+ | JPF, which is short for Java PathFinder, is an open source | ||
+ | tool that has been developed at NASA's Ames Research Center. | ||
+ | The aim of JPF is to find bugs in Java code. Instead of | ||
+ | using testing to find those bugs, JPF uses model checking. | ||
+ | The facts that JPF is downloaded hundreds of times per month | ||
+ | and that some of the key papers on JPF have been cited more | ||
+ | than a thousand times reflect the popularity of JPF. In | ||
+ | fact it is the most popular model checker for Java. | ||
- | Distributed Denial of Service (DDoS) attacks are recognized as one | + | A study done by Cambridge University in 2014 found that the |
- | of the most serious threats to today' | + | global cost of debugging code has risen to $312 billion annually. |
- | simplicity | + | Furthermore, on average software developers spend 50% of their |
- | the quality at which Web-based services are offered to the end users. | + | programming time with finding |
- | An especially challenging form of DDoS attacks are the so-called | + | advocating |
- | Application-Layer DDoS attacks. Namely: | + | |
- | 1) In Application-Layer DDoS attacks, the attackers utilize a flood | + | |
- | of legitimate-looking Layer-7 network sessions (i.e., sessions that | + | |
- | are generally hard to detect | + | |
- | IDS system); | + | |
- | 2) Increasingly, these sessions comprise HTML requests generated by a | + | |
- | cleverly programmed crawler that executes a semi-random walk over the | + | |
- | web site links, thereby attempting to appear | + | |
- | visitor. | + | |
- | The goal of this project is to investigate the state of the art in | + | Installing JPF is far from trivial. |
- | malicious web crawler design. In particular, the project will look | + | implemented |
- | into the challenges of designing | + | feasible to encapsulate JPF in a Java archive (jar) file. |
- | attacker point of view - one of these challenges being the estimation | + | This would make it significantly simplifying |
- | of web-page popularity assuming no a priori access | + | process |
- | of the victim web-site. | + | accessible |
- | ---- | + | |
- | : | + | |
- | ---- | + | |
- | ====== GFI Sandbox | + | |
- | **Supervisor**: | + | The aim of this project is to attempt to put JPF in a jar. |
+ | Since JPF relies on a number of configuration files, so-called | ||
+ | Java properties files, incorporating these properly into the | ||
+ | jar is one of the challenges. | ||
+ | another challenge. | ||
+ | our modifications to JPF should ideally be limited to only a | ||
+ | few classes, yet another challenge. | ||
- | **Required Background**: | + | In this project you may collaborate with graduate students |
+ | of the DisCoVeri group (discoveri.eecs.yorku.ca) and | ||
+ | computer scientists of NASA. For more information, | ||
+ | free to send email to franck@cse.yorku.ca. | ||
+ | **Required Background: | ||
+ | \\ | ||
+ | ------------ | ||
+ | \\ | ||
- | __Description__ | ||
- | GFI Sandbox is a sophisticated industry-leading tool for quick and | ||
- | safe analysis of malware behaviour. The goals of this project are: | ||
- | 1) familiarize yourself with the operation of GFI Sandbox; | ||
- | 2) using readily available GFI Sandbox Feeds (i.e., ThreatTrack Feeds), | ||
- | build a database of malware designed specifically for execution of | ||
- | DDoS-attacks - the so-called botnet malware; | ||
- | 3) examine the behaviour of the collected malware 'upon execution'; | ||
- | 4) propose and build an environment - comprising the standard freeware | ||
- | security tools - for longer term (beyond immediate execution) analysis | ||
- | of the collected malware. | ||
- | ---- | ||
- | : | ||
- | ---- | ||
- | Network analysis of EEG data: Understanding connections in the brain | ||
- | ====== | + | ====== |
- | **Supervisor**: Andrew Eckford | + | **Supervisor:** Zhen Ming (Jack) Jiang (zmjiang at cse dot yorku dot ca) |
- | **Required Background**: CSE 3213 (Computer Networks), CSE 3451 (Signals | + | **Required Background:** Good programming skills in Java; Good analytical |
- | **Preferred**: At least a B in all of the above courses | + | **Short Description: |
+ | \\ | ||
+ | ------------------ | ||
+ | \\ | ||
- | __Description__ | ||
- | Electroencephalogram (EEG) data indicates electrical activity at particular locations in the brain. Using EEG data from multiple sensors, it is possible to find correlations among the measurements, | ||
- | The tools that are used to analyze communication networks can also be used to analyze brain networks. In this interdisciplinary project, you will work with a collection of EEG data to identify correlated measurements, | ||
+ | ======Model-based Design and Development of Embedded Systems with Code Generation Tools====== | ||
- | ---- | + | **Supervisor:** Jia Xu |
- | : | + | |
- | ---- | + | |
- | ====== An Open Source Structural Equation Modeling Path Diagram to Syntax Application ====== | + | **Required Background: |
+ | such as Arduino. | ||
- | **Supervisor**: Jeff Edmonds | + | **Project Description: |
- | **Required Background**: | + | Model-based design with code generation tools can be used for simulation, rapid prototyping, |
- | **Recommended Background**: | + | \\ |
+ | ------------------ | ||
+ | \\ | ||
- | __Description__ | + | ======C2000 Concerto Microcontrollers====== |
- | The software required is an application that allows researchers to define their hypothesized models visually and will output the correct syntax for the analytical software of their choosing. | + | |
- | To date a promising functional application has been developed in JAVA by a Computer Science student as a 4080 project. | + | **Supervisor: |
- | This a cross-disciplinary project with the Quantitative Methods division of the Department of Psychology. As such, the student will be working with individuals with expertise | + | **Required Background: |
+ | strong C programming skills, solid knowledge | ||
+ | **Description: | ||
+ | two cores on a single-chip with on-chip low latency interprocessor communication between the two cores: a C28x 32-bit control core for | ||
+ | real-time control with faster/more loops and small sampling window; | ||
+ | and an ARM 32-bit Cortex-M3 host core for communications and general purpose. The selected student will evaluate the capabilities of the | ||
+ | C2000 Concerto family of microcontrollers through testing and investigating open source software for real-time control applications | ||
+ | that runs on C2000 Concerto Microcontrollers. | ||
- | ---- | + | \\ |
- | : | + | ------------------ |
- | ---- | + | \\ |
- | ====== | + | ======Real-Time Bidding Platform====== |
- | **Supervisor**: Rob Allison (co-supervised with Hugh Chesser, Space Engineering) | + | **Supervisor:** Jia Xu |
- | **Required Background**: General CSE408x prerequisites, familiarity with C++ and Windows software tools | + | **Required Background:** At least a B+ in Operating System Fundamentals |
+ | (CSE3221), strong Ubuntu/ | ||
+ | **Description: | ||
- | __Description__ | + | \\ |
- | The YUsend (York University Space Engineering Nanosatellite Demonstration) Lab has procured a Windows XP-based industrial computer and temperature acquisition card (as well as other hardware) for performing TVAC testing of nanosatellites in the CSIL Lab (PSE 003). A "TVAC Test Manager" | + | ------------------ |
+ | \\ | ||
- | Specific tasks include: | + | ======Circuit |
- | 1. Writing temperature acquisition card (OMEGA Engineering CIO-DAS-Temp) drivers for LabView - should be written in Visual C++ or similar | + | |
- | 2. Write LabView VI's (" | + | |
- | 3. (Optional) Interface the Test Manager with an orbital simulation tool which would be used to compute IR lamp inputs based on a simulation of the nanosatellite' | + | |
+ | **Supervisor: | ||
+ | **Description: | ||
- | ---- | + | **Required Background** A background in undergraduate-level electronics is very important. |
- | : | + | |
- | ---- | + | |
+ | \\ | ||
+ | ------------------ | ||
+ | \\ | ||
+ | More project proposals may be added here in the first week of the winter term. | ||
+ | \\ | ||
+ | ------------------ | ||
+ | \\ | ||
+ | \\ |
projects.txt · Last modified: 2016/01/13 20:05 by stevenc