Multi-disease Recognition Method for Fundus Images Based on Text Enhancement

doi:10.12146/j.issn.2095-3135.20240422001

Home > Archive>Volume 14, Issue 1, 2025 >78-90. DOI:10.12146/j.issn.2095-3135.20240422001

Multi-disease Recognition Method for Fundus Images Based on Text Enhancement
DOI:
                        10.12146/j.issn.2095-3135.20240422001
                    
CSTR:
                        
Author:
                        
Affiliation:
Clc Number:TP391,R77
Fund Project:This work is supported by Shenzhen Science and Technology Innovation Commission (JSGG20220831105002004)

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

In this work, a visual language model is introduced in ophthalmic image disease recognition. And a multi-disease recognition algorithm based on a pre-trained contrasting language-images model is proposed. First, a multi-labeled fundus image dataset MDFCD8 containing 8 categories is constructed based on several publicly available fundus image datasets. Then, the generative artificial intelligence GPT-4 (Generative Pre-trained Transformer 4) is utilized to generate expert knowledge describing the fine-grained pathological features of fundus images, which solves the problem of the lack of text labels in fundus image datasets. The paper calculates the average precision (AP), F1 score, and area under the receiver operating characteristic curve (AUC), and takes the mean value of the three as the final performance evaluation index. The experimental results showed that, the method proposed in this paper outperforms the traditional convolutional neural network and Transformer network by 4.8% and 3.2%, respectively. This study also conducted ablation experiments on each module to validate the effectiveness of the method, demonstrating the potential application of visual language modeling in the field of auxiliary diagnosis of ophthalmic diseases.

Reference

Cited by

Get Citation

XIONG Shaokui, CHEN Shifeng. Multi-disease Recognition Method for Fundus Images Based on Text Enhancement[J]. Journal of Integration Technology,2025,14(1):78-90

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:April 22,2024
Revised:April 22,2024
Adopted:
Online: June 11,2024
Published:

Home

About Journal

Editorial Team

Author Center

Peer Review

Reader Center

Ethics

Contact us

中文

Get Citation

Share

Article Metrics

History

Article QR Code