高级检索

基于 Heritrix 视频资源抓取的研究与实现

Research and Implementation of Video Resource Capture Based on Heritrix

  • 摘要: 教学视频资源是教学资源库的重要组成部分, 对视频资源的添加是系统平台的一项重要工作。目前很多教学资源库对视频资源的添加采用手工方式进行, 效率不理想且工作量极大。通过引入网络爬虫, 利用 Heritrix的扩展功能, 可以定制相应的模块, 使其自动抓取网络上的课程视频资源。而通过优化其抓取算法, 可以提高资源库中视频的抓取效率和准确率。

     

    Abstract: The video teaching resource is an important part of the teaching resource library, and it is important to add video resources for the system platform. At present, the adding of video resources for many teaching resource libraries is done by hand, which is of low efficiency and produces heavy workload. By introducing the network crawler and using the extended function of Heritrix, the corresponding module was customized to make it automatically grasp course video resources from the network. And it could improve the video grasping efficiency and accuracy of the resource library by optimizing its grasping algorithm.

     

/

返回文章
返回