The MultiMedia System

The MultiMedia system stores and manipulates patient data in the Oncology department of the Soroka hospital. The database contains various types of multi-media information including textual, images and voice.

The Structure of the MultiMedia System.

The MultiMedia system consists of five layors: Interface Layer, Intermediate Layer, MultiMedia algorithms layer, Database server layer, database layer.

MultiMedia Algorithms.

Speech Keyword processing.

The project consists of two parts: adding new keyword to the database and recognition of the existing keywords in speech files.
  1. Adding new keyword to the database includes the following steps:
  2. Recognition of a keyword in a speech file is performed in the following way:
    For each check window in the speech file competition between all models in the join model of the keyword is made. The probability of fitness of each model to the check window is calculated. If some keyword model has the greatest propability then it is compared with the threshold. (that propability can be stored in the database for future reference.) If the probability is greater that the threshold than the check window contains the keyword.

Image processing and Tumor identification.

There are two main tasks that are achieved in this project: image processing and image comparisons.
  1. Image Processing allows the following actions:
  2. Image Comparisons are performed in the following way:
    A user selects a tumor in an image and the system automatically finds the most similar tumors in a set of images. The finding algorithm gets parameters of the selected tumor and searches for tumors with the same parameters (up to an epsilon, specified by the user).The tumors are determined by the algorithm automatically, without specifying the start rectangle. The images, where the similar tumors were found, are ordered with the most fit image first. The user can load each image and view the found tumors.

Texture-based Image Segmentation.

There are the following options in the project:
  1. Perform a segmentation on an image.
    The result of a segmentation is a list of regions, where a region is characterized by a contour and a parameterized representation of the texture within the contour. Parameters of segments are stored in the database.
    Two kinds of segmentation are provided by the project:
  2. Given a segmentation of an image it is possible to perform the following actions:
  3. Image reconstruction: from a given indexed representation create an image, including the combination of texture and non-texture regions.
    Reconstruction is based on rectangles that bind different segments. Each rectangular is presented by a matrix where each entry contains a color of a pixel, if the pixel is inside the segment, or a negative value, if the pixel is outside the segment. These matrixes are merged into one large matrix of the image in a way that only positive values (with colors of pixels) are copied from small matrixes to the large matrix. The large matrix presents the reconstructed image.
  4. Presentation of parameters of all segments in an image.

Image Compression.

The compression/decompression algorithm operates on rectangles with sides which are power of 2 pixels. The interface allows to divide an image into several rectangles and define a different compression rate for each such rectangle. The process of compression generates one compressed file for one rectangle. And the process of decompression joins all compressed files (i.e. the corresponding rectangles).

General MultiMedia query

The general query module uses techniques for evaluating and combining uncertain information which have discussed by Fagin R. in "Fuzzy queries in Multi-media database systems" (Proceedings PODS, 1998, pp 1-10).
There are three types of data: text, image and voice. Possible queries for each of these types are:
Results of each query are graded sets. Each patient is assigned a grade of satisfaction to the query. A grade is a real number in the interval [0,1], which identifies the fitness of a patient to the query. All patients that satisfy a text query are assigned grades equal to 1. The results of the image and voice queries are lists of patients, sorted by their grade, where the larger the grade is, the better the match is.
A grade of a patient that satisfies image (voice) query, is taken to be the maximum grade of all images (voice files) of this patient, that contain a similar tumor (a keyword). The reason is that to characterize a patient the best image (voice file) is taken.
Now let's consider a general query which is the conjunction of these kinds of queries. The result of the conjunction contains the patients that are in the results of all these queries. Each such patient gets a grade, which is the minimum of the grades of this patient in all queries of the conjunction. Therefore, the result of the general query is the ordered list of patients, sorted by the resulted grades of these patients.