8/6/2023 0 Comments Simple comic strip on laptop![]() In this study, we conducted a systematic review of 33 papers to get a holistic understanding of existing approaches and to suggest a research road map given identified gaps. Thus, we avoid the process of extraction features a priori which will be performed automatically, taking into consideration the different characteristics of the documents.Ī number of studies have been conducted to improve the accessibility of images using touchscreen devices for screen reader users. Moreover, no binarization step of the processed document is done in order to avoid losing data that may influence the accuracy of the two frameworks. The two approaches are layout segmentation free and the generalized Haar-like filters are applied globally on the image. Describing and detailing the use of such features throughout this thesis, we offer the researches of document image analysis field a new line of research that has to be more explored in future. These two approaches are based on applying generalized Haar-like filters globally on each document image whatever its type. These characteristics guide us to introduce two multi scale approaches for two different document analysis tasks which are text extraction from comics and word spotting in manuscript document. Indeed, they were inspired by several characteristics of human vision such as the Preattentive processing. The two approaches are based on human perception characteristics. The second one points out a learning free segmentation free word spotting framework based on the query-by-string problem for manuscript documents. The first one disposes a technique for text and graphic separation in comics. The presented thesis follows two directions. The approach is compared with some existing methods found in the literature and results are presented. In this paper we propose to rely on this particularity of comic books to automatically extract frame and text using a connected-component labeling analysis. Despite of the differences, drawings have a common characteristic because of design process: they are all surrounded by a black line. In fact, the page structure depends on the author which is why many different structures and drawings exist. Only frame and speech balloon extraction have been experimented in the case of a simple page structure. Few studies has been done in this direction. Nowadays, digiti-sation allows to search directly from content instead of metadata only (e.g. Comic books represents an important heritage in many countries. Nous comparerons notre méthode avec des outils de la littérature et discuterons des résultats. Dans cet article, nous proposons de nous appuyer sur cette particularité des bandes dessinées pour extraire automatiquement les cases et le texte avec une méthode basée sur la classification de composantes connexes. Malgré cette diversité, les dessins ont une particularité commune de part leurs méthodes de conception : ils sont constitués ou entourés d'un trait noir. En effet, la structure des pages est propre à chaque auteur, ce qui engendre une très grande diversité de des-sins. Seule l'extraction des cases et des bulles de dialogues a été étudiée et ce, pour des structures de pages relativement simples. La numérisation en masse offre l'opportunité d'effectuer des recherches sur le contenu des albums et pas uniquement sur des métadonnées associées (e.g. Les bandes dessinées représentent un patrimoine culturel important dans de nombreux pays. Our approach is compared with other methods find in the literature. We propose, in this paper, a method based on region growing and mathematical morphology to extract automatically the panels of a comic page and a method to detect speech balloons. Moreover, unlike newspapers, the text layout in speech balloons can be irregular. However the text is usually embedded among graphic elements. Full text indexing is only possible if the text can be extracted. Speech balloons are other important elements of comics. ![]() In some situations, the panel extraction can become a real challenge. Moreover, authors often draw extended contents (speech balloon or comic art) that overlap two panels or more. In practice, the configuration of the page, the size and the shape of the panels can be different from one page to the next. At first glance, the structure of a comic page may appear easy to determine. However, few researches have been done in order to analyse the content of comics such as panels, speech balloons or characters. Comic books represent an important cultural heritage in many countries.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |