Adam Novozámský - postdoc in Image Processing

2021-2022
Computer Vision Lab
Faculty of Informatics, Institute of Visual Computing & Human-Centered Technology
TU Wien

Posdoc position
Project: Exploration of the Vienna City Library Poster Collection using Computer Vision Approaches
The project has been funded by the Wienbibliothek im Rathaus, Magistrat der Stadt Wien, MA9.
2010-2018
Department of Mathematics
Faculty of Nuclear Science and Physical Engineering
Czech Technical University in Prague

Training workplace: Institute of Information Theory and Automation
The Czech Academy of Sciences

PhD degree in Applications of Natural Sciences - Mathematical Engineering
Doctoral thesis: Selected Application Areas Of Image Processing: Image Forensics And Medical Imaging
[ PhD thesis ] [ presentation ] [ co-authors statement ]
2008-2010
Department of Physical Electronics
Faculty of Nuclear Science and Physical Engineering
Czech Technical University in Prague

Institute of Photonics and Electronics
The Czech Academy of Sciences

MSc degree in Applications of Natural Sciences - Engineering Informatics
Master thesis: Software for Tomographic Reconstruction of Refractive Index Profile of Special Optical Fiber Preforms
[ MSc thesis ] [ presentation ]
2003-2008
Department of Physical Electronics
Faculty of Nuclear Science and Physical Engineering
Czech Technical University in Prague

BSc degree in Applications of Natural Sciences - Engineering Informatics
Bachelor thesis: The Electronic Guard of Physically Handicapped Persons
[ BSc thesis ]

Image Forensics is very closely related to the history of analog photography. One of the first tampered images originated a few decades after Niépce created the first photographs in 1826.

Image Forensics is very closely related to the history of analog photography. One of the first tampered images originated a few decades after Niépce created the first photographs in 1826. A nice example of the analog image splicing was created sometime around the year 1864, during the American Civil War.
Tampering with images is easy with today's software tools for Image Processing. A forger can add or remove important information which completely changes the message of a processed photo. The integrity of visual data is important for the credibility of news media and especially when used as an evidence in court or during criminal investigations. For this reason, we have observed a dynamic development in this research area in recent years. Over time, two branches of forensic analysis of digital images and videos have become established as essential. One of them is integrity verification (authenticity analysis), which determines if the material has been added, altered, deleted or changed in the image. The second one is image ballistics (source device verification), which identifies the source device of the acquisition process of the image.
Although past research in these areas has mainly focused on data hiding and digital watermarking approaches, today there is a growing alternative approach called the passive one, which does not need to embed any secondary data into the image.
Methods published in this research area have, for example, attempted to detect image splicing, traces of inconsistencies in color filter array interpolation, traces of geometric transformations, cloning, computer graphics generated photos, JPEG compression inconsistencies, file structure inconsistencies, etc. Typically, these methods are based on the fact that digital image editing brings specific detectable statistical or geometrical changes into the image. Others are looking for a distortion in the reality of the image scene.

The contribution of our department to this area of research is covered by these papers:
[Ministry of Interior : VG20102013064]Pizzaro: Forensic analysis and restoration of image and video data [H2020-EU.2.1.1. : 825227]IMD2020: A Large-Scale Annotated Dataset Tailored for Detecting Manipulated Images

I have been deeply
immersed in Image Processing
since 2010, and I still enjoy it because each problem has its solution.

There is always something new to discover.

Real-Time Wheel Detection and Rim Classification in Automotive Production

2023 IEEE International Conference on Image Processing (ICIP), p. 1410-1414

This paper proposes a novel approach to real-time automatic rim detection, classification, and inspection by combining traditional computer vision and deep learning techniques. At the end of every automotive assembly line, a quality control process is carried out to identify any potential defects in the produced cars. Common yet hazardous defects are related, for example, to incorrectly mounted rims. Routine inspections are mostly conducted by human workers that are negatively affected by factors such as fatigue or distraction. We have designed a new prototype to validate whether all four wheels on a single car match in size and type. Additionally, we present three comprehensive open-source databases, CWD1500, WHEEL22, and RB600, for wheel, rim, and bolt detection, as well as rim classification, which are free-to-use for scientific purposes.

[ PDF ] [ BibTeX ] [ Dataset ] [ IEEE ] [ ICIP2023 ]

NERD: Neural Field-Based Demosaicking

2023 IEEE International Conference on Image Processing (ICIP), p. 1735-1739

We introduce NeRD, a new demosaicking method for generating full-color images from Bayer patterns. Our approach leverages advancements in neural fields to perform demosaicking by representing an image as a coordinate-based neural network with sine activation functions. The inputs to the network are spatial coordinates and a low-resolution Bayer pattern, while the outputs are the corresponding RGB values. An encoder network, which is a blend of ResNet and U-net, enhances the implicit neural representation of the image to improve its quality and ensure spatial consistency through prior learning. Our experimental results demonstrate that NeRD outperforms traditional and state-of-the-art CNN-based methods and significantly closes the gap to transformer-based methods.

[ PDF ] [ BibTeX ] [ IEEE ] [ ICIP2023 ]

Exploration of the Vienna City Library Poster Collection using Computer Vision Approaches

The 2nd International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME), p. 3341-3345

Parts of collections of libraries, archives, and e.g. museums can still be uncatalogued. Even if metadata is provided, only standardized information of the described resources (dependent on the collection) is available, i.e., creator names, titles, and subject terms, limiting the search options for experts and typical users. In the case of image-based collections, the information of the image itself can be used as an additional feature to extend the search capabilities of the user. This paper analyzes the use of standard computer vision methods to explore the Vienna City Library poster collection using additional image-based properties. The proposed exploration tool allows a search based on the provided metadata and features based on face detection and retrieval, image retrieval, object detection, text recognition, and main color similarity. The OpenSearch engine is used to index the metadata and visual features, allowing for a real-time search of extensive collections. The qualitativ and quantitative analysis shows the potential of visual features within a search tool.

[ PDF ] [ BibTeX ] [ IEEE ] [ ICECCME2022 ]

Monitoring of Varroa Infestation Rate in Beehives: A Simple AI Approach

2022 IEEE International Conference on Image Processing (ICIP), p. 3341-3345

This paper addresses the monitoring of Varroa destructor infestation in Western honey bee colonies. We propose a simple approach using automatic image-based analysis of the fallout on beehive bottom boards. In contrast to the existing high-tech methods, our solution does not require extensive and expensive hardware components, just a standard smart-phone. The described method has the potential to replace the time-consuming, inaccurate, and most common practice where the infestation level is evaluated manually. The underlining machine learning method combines a thresholding algorithm with a shallow CNN—VarroaNet. It provides a reliable estimate of the infestation level with a mean infestation level accuracy of 96.0% and 93.8% in the autumn and winter, respectively. Furthermore, we introduce the developed end-to-end system and its deployment into the online beekeeper’s diary—ProBee—that allows users to identify and track infestation levels on bee colonies.

[ PDF ] [ BibTeX ] [ IEEE ] [ ICIP2022 ]

Videokymogram Analyzer Tool: Human–computer comparison

Biomedical Signal Processing and Control, vol.78, September (2022), 103878

Videokymography (VKG) is a modern video recording technique used in laryngology and phoniatrics to examine vocal fold vibrations. To obtain quantitative information on the vocal fold vibration, VKG image analysis is needed but no software has yet been validated for this purpose. Here, we introduce a validated software tool that aids clinicians to evaluate diagnostically important vibration characteristics in VKG and other types of kymographic recordings. State-of-the-art methods for automated image evaluation were implemented and tested on a set of videokymograms with a wide range of vibratory characteristics, including healthy and pathologic voices. The automated image segmentation results were compared to manual segmentation results of six evaluators revealing average differences smaller than one pixel. Furthermore, the automatically categorized vibratory parameters precisely agreed with the average visual assessment in 84 and 91 percent of the cases for pathological and healthy patients, respectively. Based on these results, the newly developed software was found to be a valid, reliable automated tool for the quantification of vocal fold vibrations from VKG images, offering a number of novel features relevant for clinical practice.

[ PDF ] [ BibTeX ] [ Elsevier ]

Extended IMD2020: a large‐scale annotated dataset tailored for detecting manipulated images

IET Biometrics, vol.10, 4 (2021), p. 392–407

Image forensic datasets need to accommodate a complex diversity of systematic noise and intrinsic image artefacts to prevent any overfitting of learning methods to a small set of camera types or manipulation techniques. Such artefacts are created during the image acquisition as well as the manipulating process itself (e.g., noise due to sensors, interpolation artefacts, etc.). Here, the authors introduce three datasets. First, we identified the majority of camera models on the market. Then, we collected a dataset of 35,000 real images captured by these cameras. We also created the same number of digitally manipulated images. Additionally, we also collected a dataset of 2,000 ‘real-life’ (uncontrolled) manipulated images. They are made by unknown people and downloaded from the Internet. The real versions of these images are also provided. We also manually created binary masks localising the exact manipulated areas of these images. Moreover, we captured a set of 2,759 real images formed by 32 unique cameras (19 different camera models) in a controlled way by ourselves. Here, the processing history of all images is guaranteed. This set includes categorised images of uniform areas as well as natural images that can be used effectively for analysis of the sensor noise.

[ PDF ] [ BibTeX ] [ IET Biometrics ]

Automated Object Labeling For CNN-Based Image Segmentation

2020 IEEE International Conference on Image Processing (ICIP), p. 2036-2040

Deep learning-based methods for classification and segmentation require large training sets. Generating training data is often a tedious and expensive task. In industrial applications, such as automated visual inspection of products in an assemble line, objects for classification are well defined yet labeled data are difficult to obtain. To alleviate the problem of manual labeling, we propose to train a convolutional neural network with an automatically generated training set using a naive classifier with handcrafted features. We show that when the naive classifier has high precision then the trained network has both high precision and recall despite the low recall of the naive classifier. We demonstrate the proposed methodology on real scenario of detecting a car coolant tank. However, the proposed methodology facilitates collection of train data for a wider type of CNN based methods such as near-duplicate image detection or segmenting tampered areas of images.

[ PDF ] [ BibTeX ] [ IEEE ] [ ICIP2020 ]

IMD2020: A Large-Scale Annotated Dataset Tailored for Detecting Manipulated Images

2020 IEEE Winter Applications of Computer Vision Workshops (WACVW), p. 71-80

Witnessing impressive results of deep nets in a number of computer vision problems, the image forensic community has begun to utilize them in the challenging domain of detecting manipulated visual content. One of the obstacles to replicate the success of deep nets here is absence of diverse datasets tailored for training and testing of image forensic methods. Such datasets need to be designed to capture wide and complex types of systematic noise and intrinsic artifacts of images in order to avoid overfitting of learning methods to just a narrow set of camera types or types of manipulations. These artifacts are brought into visual content by various components of the image acquisition process as well as the manipulating process. In this paper, we introduce two novel datasets. First, we identified the majority of camera brands and models on the market, which resulted in 2,322 camera models. Then, we collected a dataset of 35,000 real images captured by these camera models. Moreover, we also created the same number of digitally manipulated images by using a large variety of core image manipulation methods as well we advanced ones such as GAN or Inpainting resulting in a dataset of 70,000 images. In addition to this dataset, we also created a dataset of 2,000 “real-life” (uncontrolled) manipulated images. They are made by unknown people and downloaded from Internet. The real versions of these images also have been found and are provided. We also manually created binary masks localizing the exact manipulated areas of these images. Both datasets are publicly available for the research community at http://staff.utia.cas.cz/novozada/db.

[ PDF ] [ BibTeX ] [ IEEE ] [ WACV2020 ]

Detection of Copy-move Image Modification Using JPEG Compression Model

Forensic Science International vol.283, 1 (2018), p. 47-57

The so-called copy-move forgery, based on copying an object and pasting in another location of the same image, is a common way to manipulate image content. In this paper, we address the problem of copy-move forgery detection in JPEG images. The main problem with JPEG compression is that the same pixels, after moving to a different position and storing in the JPEG format, have different values. The majority of existing algorithms is based on matching pairs of similar patches, which generates many false matches. In many cases they cannot be eliminated by postprocessing, causing the failure of detection. To overcome this problem, we derive a JPEG-based constraint that any pair of patches must satisfy to be considered a valid candidate and propose an efficient algorithm to verify the constraint. The constraint can be integrated into most existing methods. Experiments show significant improvement of detection, especially for difficult cases, such as small objects, objects covered by textureless areas and repeated patterns.

[ PDF ] [ BibTeX ] [ Elsevier ]

Automatic blood detection in capsule endoscopy video

Journal of Biomedical Optics vol.21, 12 (2016), p. 1-8

We propose two automatic methods for detecting bleeding in wireless capsule endoscopy videos of the small intestine. The first one uses solely the color information, whereas the second one incorporates the assumptions about the blood spot shape and size. The original idea is namely the definition of a new color space that provides good separability of blood pixels and intestinal wall. Both methods can be applied either individually or their results can be fused together for the final decision. We evaluate their individual performance and various fusion rules on real data, manually annotated by an endoscopist.

[ PDF ] [ BibTeX ] [ SPIE ]

PIZZARO: Forensic analysis and restoration of image and video data

Forensic Science International vol.264, 1 (2016), p. 153-166

This paper introduces a set of methods for image and video forensic analysis. They were designed to help to assess image and video credibility and origin and to restore and increase image quality by diminishing unwanted blur, noise, and other possible artifacts. The motivation came from the best practices used in the criminal investigation utilizing images and/or videos. The determination of the image source, the verification of the image content, and image restoration were identified as the most important issues of which automation can facilitate criminalists work. Novel theoretical results complemented with existing approaches (LCD re-capture detection and denoising) were implemented in the PIZZARO software tool, which consists of the image processing functionality as well as of reporting and archiving functions to ensure the repeatability of image analysis procedures and thus fulfills formal aspects of the image/video analysis work. Comparison of new proposed methods with the state of the art approaches is shown. Real use cases are presented, which illustrate the functionality of the developed methods and demonstrate their applicability in different situations. The use cases as well as the method design were solved in tight cooperation of scientists from the Institute of Criminalistics, National Drug Headquarters of the Criminal Police and Investigation Service of the Police of the Czech Republic, and image processing experts from the Czech Academy of Sciences.

[ PDF ] [ BibTeX ] [ Elsevier ]

Department of Mathematics, Faculty of Nuclear Science and Physical Engineering, Czech Technical University in Prague

Abstract: This thesis will explore advanced image processing and machine learning techniques for analyzing biological data, with a focus on cell detection, segmentation, and tracking. Utilizing image datasets from the Institute of Biochemistry and Microbiology at VŠCHT, we will develop training datasets for algorithmic training and design specific metrics for algorithm evaluation. We will test and implement selected algorithms into tools for VŠCHT researchers. The effectiveness of these solutions will be assessed using annotated data, demonstrating significant potential for enhancing biological data analysis.

[2023 - 2024] MSc - Soňa Drocárová : Road Surface Condition Monitoring System

Department of Mathematics, Faculty of Nuclear Science and Physical Engineering, Czech Technical University in Prague

Abstract: TBD

[2022 - 2024] BSc - Anel Mergaliyeva : Smart Robotic Arm Using Computer Vision

Department of Mathematics, Faculty of Nuclear Science and Physical Engineering, Czech Technical University in Prague

Guidelines: Learn about programming a robotic arm and write simple scripts to verify its parameters, such as reaction time, range, speed, and movement accuracy. Based on the verified parameters, choose a two-player board game in which the robotic arm represents one of the opponents. Design the game space sensing system so that the control of the robotic arm can make decisions throughout the game based only on image data. Depending on the chosen board game, search for existing image information processing methods for game analysis and proper system response. Program the game logic for the selected board game. Design and implement the final system so that a human can play the game and be a comparable opponent to the human.

[2022 - 2024] MSc - Antonín Čech : Estimation and Tracking of the Human Pose with a Single RGB

Department of Software Engineering, Faculty of Nuclear Science and Physical Engineering, Czech Technical University in Prague

Guidelines: Learn about the problem of human location estimation using modern image recognition techniques. Based on this research, select several methods to use in your thesis. Download several freely available annotated datasets you have come across during your literature search in the literature review. Next, create a set of unannotated videos for testing purposes. For the selected methods, compare the success and speed of the obtained data. Propose a method for tracking the people in the video. Implement a GUI to visualize each method for detecting and following people.

[ MSc thesis] [ GitHub] [ presentation - CZE ]

[2022 - 2023] BSc - Michal Průšek : Gesture-controlled drone

Department of Mathematics, Faculty of Nuclear Science and Physical Engineering, Czech Technical University in Prague

Abstract: The subject of this work is gesture segmentation using classical image processing methods. Namely, thresholding, morphological transformations, histogram backprojection and others. Subsequently, an approach to gesture classification based on its contour and using Fourier descriptors is described. Finally, a comparison is made between the success rates of gesture classification using our proposed method and those based on a neural network model.

[ BSc thesis - CZE] [ GitHub - CZE] [ dataset ] [ presentation - CZE ] [ video ]

[2021 - 2022] Research Task - Anna Gruberová : Optical Character Recognition on Scanned Historical Posters Using the State-of-the-Art Methods

Department of Software Engineering, Faculty of Nuclear Science and Physical Engineering, Czech Technical University in Prague

Abstract: Optical character recognition from image data is a demanding task in today’s world, because it is already impossible for humans to process a large amount of image data without automation. The Vienna Library owns over 350,000 digitized historical posters, from which the displayed text must be extracted. The aim of this work is to map existing text recognition methods and test them on selected datasets.

[ Research task ] [ presentation ]

[2021 - 2022] Research Task - Soňa Drocárová : Comparison of methods for unconstrained face detection

Department of Mathematics, Faculty of Nuclear Science and Physical Engineering, Czech Technical University in Prague

Abstract: This thesis deals with face detection under unconstrained conditions. It aims to describe detection methods and select the appropriate approaches as well as datasets for testing them. The testing is done with the help of two publicly available collections (WFLW, CelebA) and one set created for the purposes of this thesis by labeling data from the Vienna City Library. The performance of these algorithms in less ideal conditions was also tested on more challenging subsets of the CelebA and WFLW datasets.

[ Research task ] [ presentation ]

[2021 - 2022] MSc - Adéla Kostelecká : Content-Based Image Retrieval: from Primitive to Advanced Techniques

Department of Algebra, Faculty of Mathematics and Physics, Charles University

Abstract: The Wienbibliothek im Rathaus, Vienna City Library, collected over 300 thousand posters scanned in high quality from the last 100 years. Browsing and searching in such a large dataset is beyond human power. Therefore, a project was set up in cooperation with the Technical University of Vienna to test the possibilities of automatic data annotation on a selected sample. One of the requirements was Content-based Image Retrieval - retrieving images based on their visual content. This thesis reviews these techniques that emerged over the last decades. We focus on simple techniques based on colour, texture, and shape, as well as more advanced algorithms using convolutional neural networks. We implement these methods and compare their retrieval effectiveness on particular image datasets. Finally, we describe the functionality of a developed web application.

[ MSc thesis ] [ presentation ]

[2020 - 2022] MSc - Roman Staněk : System for automatic size and type control of cars rims

Department of Software and Computer Science Education, Faculty of Mathematics and Physics, Charles University

Abstract: At the end of every automotive assembly line, there is a quality control process where factory workers check produced cars for potential defects. The computer vision field, especially neural networks for images, have great potential to complement human staff in order to produce as safe and reliable cars as possible. In this thesis we focus on the validation, whether all four wheels on a single car match in size and type. We introduce and experiment with both neural networks and traditional computer vision techniques. The approach we use is to first detect the car then classify its wheels and try to estimate their size. In the end we build a functional prototype of the system that is running in real-time. The data for this thesis were recorded in Škoda Auto factory in Mladá Boleslav in cooperation with the company.

[ MSc thesis ] [ presentation ]

[2020 - 2021] BSc - Markéta Machalová : Detection of Varroa destructor using computer vision

Department of Algebra, Faculty of Mathematics and Physics, Charles University

Abstract: The bachelor thesis deals with the design and description of a tool that can simplify the process of monitoring varroosis. Using image processing methods, it detects individual mites from the picture of the bottom board in the beehive. For better image resolution we used several images from one bottom board. Those images were then stitched into one image using image registration. In the first part we focused on the use of classical image processing methods that can detect varroa only on the basis of the approximation of the body of the mite with a parametrically descriptive curve. In the second part we used a more powerful convolutional neural network. During each of the 500 training epochs we saved the parameters of success. Finally, we inserted a test data into the trained network and compared it with expected outputs. The thesis contains a theoretical description of algorithms and methods, their use in our detector, and interpretation of results.
[ BSc thesis - CZE] [ presentation - CZE]

[2020 - 2021] MSc - Kajetán Poliak : Hand pose estimation during car driving

Department of Mathematics, Faculty of Nuclear Science and Physical Engineering, Czech Technical University in Prague

Abstract: Hand pose estimation plays a fundamental role in human computer interactions, moreover it allows us to analyze human behavior. The problem is nontrivial due to complicated hand variations caused by complex articulations, self-occlusions or shape, size and color ambiguities. We provide complex proposal of different detection and pose estimation methods and their evaluation. The comparison of deep map and RGB based pose estimation is provided and applied to real-life data from the driving simulator. Furthermore we design an annotation tool for RGB-D data which we used to produce a test dataset.
[ MSc thesis ] [ presentation ]

[2020 - 2021] MSc - Adam Novotný : Satellite data analysis using machine learning methods

Department of Mathematics, Faculty of Nuclear Science and Physical Engineering, Czech Technical University in Prague

Abstract: The application of machine learning, and of its subset deep learning, has improved and enabled new techniques in the domain of computer vision. Sentinel-2 mission is specifically designed to provide high-resolution optical satellite imagery, which is suitable for monitoring surface vegetation. In addition to satellite images, we are provided with agricultural field data in 2019 by State Agricultural Intervention Fund. Combining these two data sources, datasets for field crop classification into thirteen classes are prepared. Both feature-based classifiers and convolutional neural networks are then applied on these pre-processed datasets and the results of each classifier and its performance are analyzed, discussed and compared.
[ MSc thesis ] [ presentation ]

[2019 - 2020] MSc - Dominik Vít : Automated visual inspection system for a car engine space

Department of Mathematics, Faculty of Nuclear Science and Physical Engineering, Czech Technical University in Prague

Abstract: In most automobile factories, the quality inspection process is mainly based on vision control, which is often insufficient and unstable. The artificial intelligence can improve some parts of this quality process. The aim of our project was to develop the "Proof-of-concept" of such an automated visual inspection system. We focused on the level of cooling liquid. The analysis includes data collection and labeling, pre-processing, feature extraction and classification.
[ MSc thesis ] [ presentation ]

Digital image processing

[2021 - present] Department of Mathematics, Faculty of Nuclear Science and Physical Engineering, Czech Technical University in Prague

Successor of the course - Image processing and recognition I. (below)

course description in Czech
Machine Learning I.

[2021 - present] Department of Mathematics, Faculty of Nuclear Science and Physical Engineering, Czech Technical University in Prague

This course serves as an introductory journey into the realms of machine learning, pattern recognition, and artificial intelligence, blending theoretical knowledge with practical application. It begins with an overview of artificial intelligence, covering essential concepts and terminologies, and progresses to frame decision-making and recognition as optimization challenges. Students will gain hands-on experience with various intuitive classifiers, including k-nearest neighbors (k-NN), linear classifiers, and Support Vector Machines (SVM), understanding their learning mechanisms.

course description in Czech
Machine Learning II.

[2021 - present] Department of Mathematics, Faculty of Nuclear Science and Physical Engineering, Czech Technical University in Prague

This introductory course focuses on deep learning, covering theoretical aspects and network architectures. Topics include Neural Networks, Optimization, Convolutional Neural Networks, Generative Models, RNNs and advanced techniques like Transformers. In labs, students gain hands-on experience with Python, PyTorch, Image Classification, Transfer Learning, Object Detection, Sequence Models, NLP, Advanced Vision and Generative Models. The course concludes with a project presentation and competition.

course description in English
Digital image processing in practice

[2013 - present] Department of Software and Computer Science Education, Faculty of Mathematics and Physics, Charles University

The seminar aims at digital image processing and pattern recognition. It complements the the subject Digital Image Processing taught at the same department. Moreover, experiments and practical applications will be demonstrated here, in the programming language MATLAB. The covered topics are: image acquisition and preprocessing (noise reduction, contrast enhancement, deblurring), edge detection, geometric transformations, features for object description and methods of automatic recognition (classification).

course description in English
Image processing and recognition I.

[2010 - 2021] Department of Mathematics, Faculty of Nuclear Science and Physical Engineering, Czech Technical University in Prague

Introductory course in digital image processing and recognition. The main focus is on image digitization, preprocessing (noise reduction, contrast enhancement, blur removal), edge detection, segmentation and geometric transformations.

course description in Czech
Image processing and recognition II.

[2010 - 2021] Department of Mathematics, Faculty of Nuclear Science and Physical Engineering, Czech Technical University in Prague

The course is a direct continuation of the introductory course ROZ1. The main attention is paid to the general theory of feature recognition (classification) and its application to the recognition of 2-D objects in digital images.

course description in Czech

VIGILANT - Vital IntelliGence to Investigate ILlegAl DisiNformaTion

[2022 - 2025] HORIZON.2.3.2 : 101073921

Objectives: Identifying, tracking and investigating online disinformation and other problematic content is an extremely complex problem. Many (European) Police Authorities (PAs) do not have access to any specialised tools or technologies to help them combat disinformation. Some of the better equipped PAs are using unsuitable off-the-shelf products that were designed to enable commercial companies to monitor social media chatter about their brands and products or to track the success of advertising campaigns. Such products are not capable of dealing with the complexities of disinformation nor do they provide advanced analysis tools and technologies tailored to the PAs needs. The VIGILANT project solves this problem. It is an integrated platform of advanced disinformation identification and analysis tools and technologies employing state-of-the-art AI methods which were developed as part of several highly successful FP7 and H2020 projects. They will be tailored to PAs use cases and needs, following an ethical-by-design and user-centric approach. In addition, social and behavioural aspects are also taken into account when designing tools and presenting results. VIGILANT covers disinformation from all major sources (e.g., major social media platforms, fake news websites), in all modalities (text, image, video) and in multiple languages. It is also suitable for investigating hate speech, violent nationalist or separatist movements, radicalisation, extremist groups, incels, loan wolfs, and other counter terrorism threats. The project also provides innovative solutions to leverage the knowledge and experience of other stakeholder organisations and (social) media companies. Finally, the project includes training for PAs in the use of VIGILANT and in conducting disinformation investigations as part of a long term sustainable training network. The interdisciplinary consortium includes expertise from social sciences and humanities, ethics, computer science commercial and four European PAs.

web of the project project description

AISEE - Artificial Intelligence based Search Environment for video/photo

[2022 - 2025] Ministry of the Interior : VJ02010029

Objectives: The project addresses AI data mining in forensic practice. It will offer a software platform for detecting objects of interest based on complex spatio-temporal relationships with object attributes and indexed correlations. It will enable training of AI methods directly in the Police environment using AI semi-automatic and "weakly supervised" learning procedures with end-user involvement. The proposed guidelines will take into account the scarcity of training data. It will include an interface for querying and evaluating the answers to meet the needs of forensic practice. The platform will allow for the evaluation of effectiveness given practice constraints. The development will be in three phases with user feedback. A set of test video sequences according to user scenarios and a research report addressing aspects of the use of AI methods from the perspective of ethics, legal framework, and human rights will be created.

web of the project project description

POSTER PROJECT

[2021 - 2022] MA9 - Wienbibliothek im Rathaus : TU WIEN

Objectives: This project deals with the automated annotation of the poster collection to make the collection searchable. The goal is to provide a prototype which allows to search based on text, faces, objects, similarity, and basic image features (e.g. color).

project description

WRITE

[2020 - 2023] FFG Kiras - 879687

Objectives: Law enforcement agencies possess an extensive collection of handwritten documents. This includes for example documents belonging to open cases and reference samples from suspects and prisoners. Yet, these collections of documents can only be utilized to a limited extend, since for an identification of an unknown writer all documents have to be compared manually by handwriting experts. WRITE aims to solve this problem by developing an IT based solution which allows for the search of similar handwritings. In this way, the identification of unknown writers by handwriting experts can be expedited, since only a small number of documents with similar handwritings have to be compared manually.

project description

Mobile diagnostic system for reduction of consumption and rational use of antibiotics for primary milk production

[2020 - 2022] Technology Agency of the Czech Republic : FW01010343

Objectives: The main objective of the project is to develop a dairy cow health control system, the ultimate goal of which would be to significantly reduce the use of antibiotics in the treatment and prevention of infectious mammary gland inflammation. Sub-goals: - Design a system of continuous microbiological diagnostics on dairy farms - Monitor the health and economic benefits of consistently applying the above system over a period of time. - Develop hardware equipment for the optical reading of color colonies of cultured microorganisms. - Develop a software solution associated with the reader to analyze the displayed microbial colonies. - Set up mutual links between the reader, its SW, the system administrator and the central database - Perform a thorough test of the whole system functionality.

project description

VKG 3.0

[2019 - 2021] Technology Agency of the Czech Republic : TH04010422

Objectives: The aim of the VKG 3.0 project is a new system for the diagnosis of vocal disorders, consisting of a new type of multi-line video camera and data processing software. The camera will allow to capture the vocal cords in a mode that detects their movement in several places at the same time, so the expert will have better idea of the vocal behavior and thus the ability to effectively make the correct diagnosis. The software for the proposed multi-line camera will be developed with the emphasis on data interpretation, special care will be devoted to the intuitive visualization of captured data. There are several features computed for each scan line. A large number of such data would not allow an effective evaluation of the finding.only the significant data will be fused.

project description

National Competence Center - Cybernetics and Artificial Intelligence

[2018 - 2022] Technology Agency of the Czech Republic : TN01000024

Objectives: The NCK KUI project aims to create a national platform for cybernetics and artificial intelligence which interlinks research and application oriented centers of robotics and cybernetics for Industry 4.0, Smart Cities, intelligent transport systems and cybersecurity. The connection of innovation leaders will raise effectivity of applied research in key areas, as advanced technology for globally competitive industry, ICT and transportation for the 21st century. NCK KUI is closely related to application sector and enables cross-domain collaboration, innovation development and technology transfer.

project description

PROVENANCE - PROviding VErificatioN AssistaNCE for New Content

[2018 - 2022] H2020-EU.2.1.1. : 825227

Objectives: PROVENANCE will develop an intermediary-free solution for digital content verification that gives greater control to users of social media and underpins the dynamics of social sharing in values of trust, openness, and fair participation. Specifically, PROVENANCE will use blockchain to record, in a secure and verifiable manner, multimedia content that is uploaded and registered by content creators or identified for registration by the PROVENANCE Social Network Monitor. The PROVENANCE Verification Layer will apply advanced tools for multimedia analytics (semantic uplift, image forensics, cascade analysis) to record any modifications to content assets and to identify similar pieces of content. A personalised Digital Companion will cater to the information needs of end-users. To help consumers navigate content and develop digital literacy competencies, an iconographic Verification Indicator will contextualise individual pieces of content with relevant information including when the content was registered, by whom, and any subsequent transactions. PROVENANCE will be co-created with diverse representatives of civil society across four distinct use-cases in the social media domain (citizen information seekers, citizen prosumers, factual content creators, and creative content creators). However, the findings will be applicable to any area in which social media and verification are important. The scientific and pragmatic insights gained through PROVENANCE will significantly advance the state of the art in intermediary-free solutions for content verification, understanding of information cascades and information sources on social media, the openness of algorithms, and user control over personal data. In so doing, it will lay the foundation for a new federated social network grounded in trust, openness, and fair participation. In addition, it will support the development of an observatory on information veracity and social media best practice under the ICT28 CSA.

project web page project description

ASSISLT - Automated Software System In Speech-Language Therapy

[2018 - 2019] Technology Agency of the Czech Republic : TJ01000181

Objectives: The aim of our project is to create a software system to support speech therapy for adults and children with inborn and acquired motor speech disorders. The planned system focuses on individual treatment using exercises that improve tongue motion and thus articulation. The system will offer adjustable set of exercises recommended by a therapist, motivation by using augmented reality, evaluation of the performance of therapeutic movements, and session archivation. It will allow the therapist to evaluate the schedule and progress of the treatment. Linking the tongue movement and characters in a computer game will motivate children. The basic component of the system is the module evaluating the tongue motion based on image data from a commercially available camera.

project web page project description

Automatic evaluation of videokymographic recordings for early diagnosis and prevention of vocal fold tumors

[2014 - 2017] Technology Agency of the Czech Republic : TA04010877

Objectives: The goal of the project is to develop a sophisticated software for videokymography (VKG) which will enable automatic evaluation of medical videokymographic recordings of vibrating vocal folds and arrive at correct medical diagnosis. Further goal is to develop a certified method of VKG evaluation to be used in clinical practice. VKG camera is an existing device which was developed in 1994 in collaboration of Czech and Dutch colleagues. It has been used for diagnosis of vocal fold vibration problems caused by various voice disorders, the most serious being vocal fold cancer. Currently, evaluation of VKG recordings can be done only by a highly experienced clinician. The evaluation is complicated and time-consuming. Therefore the method has not been widely used. Addition of a software for automatic evaluation will allow wider, more efficient and less time-demanding application of the method in clinical practice and an early and rather inexpensive diagnosis of tumorous states. It will allow detecting vocal fold cancer at an early stage when the treatment can be done noninvasively or by a simple surgical intervention so that the quality of life is preserved. The project is based on collaboration of four partners: clinical centre specialized in diagnosis and treatment of voice disorders (Voice Centre Prague, Medical Healthcom, Ltd), team of the inventor of the method of videokymography (dr. Svec, Department of Biophysics, Palacky University Olomouc), research institute specialized in digital image analysis (Institute of Information Theory and Automation, Academy of Sciences of the Czech Republic) and a company experienced in diagnostic product development (STARMANS electronics).

project description

Capsule endoscopy in diagnostics of small bowel mucosal injury induced by nonsteroidal anti-inflammatory drugs

[2012 - 2015] Technology Agency of the Czech Republic : NT13532

Objectives: Prospective study is focused on identification of endoscopy, clinical and laboratory small bowel injury markers in long- term NSAID users. The patients with rheumatoid arthritis , osteoarthritis and healthy volunteers will be included into our study. The definition of the normal findings allows identification of the real NSAID induced injury. The detailed questionnaire concentrated on clinical signs will be filled with all participants. The laboratory tests and capsule endoscopy will be integral part of our study. The endoscopy findings will be scored according to the severity.

project description

Tools for imaging device identification, authentication, and image reconstruction

[2010 - 2013] Ministry of Interior : VG20102013064

Objectives: Software application, consisting of three modules for device identification, an authentication, and image reconstruction, respectively. The project output will enable an identification of the imaging device (digital cameras, camcoders), it will exclude the possibility of intentional post-processing changes of images or video. Finally, it will provide tools for quality improvement of analyzed image and video data using digital reconstruction methods.

project description

Development of methods for image analysis of photographs in digital and analog form for data authentication

[2010 - 2012] Ministry of Interior : VF20102012010

Objectives: Proposal of new software methods for image data authentication in still image record.

project description

novozamsky@utia.cas.cz skype: newcastlea +420-777-577-375

Department of Image Processing
Institute of Information Theory and Automation
The Czech Academy of Sciences
Pod Vodárenskou věží 4, Prague, Czechia
novozamsky@utia.cas.cz

Education : .

Research : .

Image Processing & Computer Vision

Medical Imaging

Image Forensics

Biological imaging

Deep Learning for Computer Vision

Publications : .

Real-Time Wheel Detection and Rim Classification in Automotive Production

NERD: Neural Field-Based Demosaicking

Exploration of the Vienna City Library Poster Collection using Computer Vision Approaches

Monitoring of Varroa Infestation Rate in Beehives: A Simple AI Approach

Videokymogram Analyzer Tool: Human–computer comparison

Extended IMD2020: a large‐scale annotated dataset tailored for detecting manipulated images

Automated Object Labeling For CNN-Based Image Segmentation

IMD2020: A Large-Scale Annotated Dataset Tailored for Detecting Manipulated Images

Detection of Copy-move Image Modification Using JPEG Compression Model

Automatic blood detection in capsule endoscopy video

PIZZARO: Forensic analysis and restoration of image and video data

Teaching : .

[2023 - 2024] Research Task - Michal Průšek : Biological data analysis and evaluation

Department of Mathematics, Faculty of Nuclear Science and Physical Engineering, Czech Technical University in Prague

[2023 - 2024] MSc - Soňa Drocárová : Road Surface Condition Monitoring System

Department of Mathematics, Faculty of Nuclear Science and Physical Engineering, Czech Technical University in Prague

[2022 - 2024] BSc - Anel Mergaliyeva : Smart Robotic Arm Using Computer Vision

Department of Mathematics, Faculty of Nuclear Science and Physical Engineering, Czech Technical University in Prague

[2022 - 2024] MSc - Antonín Čech : Estimation and Tracking of the Human Pose with a Single RGB

Department of Software Engineering, Faculty of Nuclear Science and Physical Engineering, Czech Technical University in Prague

[2022 - 2023] BSc - Michal Průšek : Gesture-controlled drone

Department of Mathematics, Faculty of Nuclear Science and Physical Engineering, Czech Technical University in Prague

[2021 - 2022] Research Task - Anna Gruberová : Optical Character Recognition on Scanned Historical Posters Using the State-of-the-Art Methods

Department of Software Engineering, Faculty of Nuclear Science and Physical Engineering, Czech Technical University in Prague

[2021 - 2022] Research Task - Soňa Drocárová : Comparison of methods for unconstrained face detection

Department of Mathematics, Faculty of Nuclear Science and Physical Engineering, Czech Technical University in Prague

[2021 - 2022] MSc - Adéla Kostelecká : Content-Based Image Retrieval: from Primitive to Advanced Techniques

Department of Algebra, Faculty of Mathematics and Physics, Charles University

[2020 - 2022] MSc - Roman Staněk : System for automatic size and type control of cars rims

Department of Software and Computer Science Education, Faculty of Mathematics and Physics, Charles University

[2020 - 2021] BSc - Markéta Machalová : Detection of Varroa destructor using computer vision

Department of Algebra, Faculty of Mathematics and Physics, Charles University

[2020 - 2021] MSc - Kajetán Poliak : Hand pose estimation during car driving

Department of Mathematics, Faculty of Nuclear Science and Physical Engineering, Czech Technical University in Prague

[2020 - 2021] MSc - Adam Novotný : Satellite data analysis using machine learning methods

Department of Mathematics, Faculty of Nuclear Science and Physical Engineering, Czech Technical University in Prague

[2019 - 2020] MSc - Dominik Vít : Automated visual inspection system for a car engine space

Department of Mathematics, Faculty of Nuclear Science and Physical Engineering, Czech Technical University in Prague

Grants I worked on : .

VIGILANT - Vital IntelliGence to Investigate ILlegAl DisiNformaTion

AISEE - Artificial Intelligence based Search Environment for video/photo

POSTER PROJECT

WRITE

Mobile diagnostic system for reduction of consumption and rational use of antibiotics for primary milk production

VKG 3.0

National Competence Center - Cybernetics and Artificial Intelligence

PROVENANCE - PROviding VErificatioN AssistaNCE for New Content

ASSISLT - Automated Software System In Speech-Language Therapy

Automatic evaluation of videokymographic recordings for early diagnosis and prevention of vocal fold tumors

Capsule endoscopy in diagnostics of small bowel mucosal injury induced by nonsteroidal anti-inflammatory drugs

Tools for imaging device identification, authentication, and image reconstruction

Development of methods for image analysis of photographs in digital and analog form for data authentication

Contact : .