Skip to content

Official implementation of projects/papers from the Bilibili account "具身人机", focusing on embodied intelligence, human-computer interaction, and robotics.

Notifications You must be signed in to change notification settings

Tigerdwgth/JushenRenji

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

35 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pdf-to-video-app/pdf-to-video-app/README.md

Paper2Video App

该项目是一个将PDF文件转换为讲解视频的应用程序。用户可以输入一个PDF文件,应用程序将提取其中的文本和图片,并生成一个引人注目的讲解视频。

项目结构

├── cache
├── config.yaml
├── font
├── npm-debug.log
├── output
├── package.json
├── pic
├── README_EN.md
├── README.md
├── requirements.txt
└── src
    ├── auto_upload_bilibili.py
    ├── code_interpreter.py
    ├── config.py
    ├── distribution
    ├── extract_images_from_pdf.py
    ├── generate_cover.py
    ├── get_arxiv_latest.py
    ├── get_website_data.py
    ├── gui.py
    ├── llm_agent.py
    ├── main.py
    ├── pdf_processor.py
    ├── __pycache__
    ├── types
    ├── utils
    └── video_creator.py

功能

  1. PDF处理:从PDF文件中提取文本和图片。
  2. 视频创建:将提取的内容合成视频。
  3. 摘要和标题生成:生成引人注目的摘要和标题,以吸引观众。

使用说明

  1. 确保已安装所有依赖项。可以通过以下命令安装: 安装tesseract-ocr,参照 https://tesseract-ocr.github.io/tessdoc/Installation.html

    conda create -n paperagent python=3.10
    conda activate paperagent
    pip install -r requirements.txt
    pip3 install torch torchvision torchaudio #linux
    pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu126 #win
    pip install -e .
    
    

自动上传更新功能

安装小红书MCP服务

conda install conda-forge::nodejs

Run from your project's root directory

npm init playwright@latest

Or create a new project

npm init playwright@latest new-project

请修改 anaconda3\envs\paperagent\lib\site-packages\deepdoctection\extern\tessocr.py

   # 263行 前加入
   if not results:
      return all_results
   #181行 删掉文件后缀名
   with open(tmp_name , "rb") as output_file:
  1. 运行应用程序:

    python src/main.py --filename "{papername}"
    

贡献

欢迎任何形式的贡献!请提交问题或拉取请求。

许可证

该项目遵循MIT许可证。

About

Official implementation of projects/papers from the Bilibili account "具身人机", focusing on embodied intelligence, human-computer interaction, and robotics.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages