Skip to content

Commit 8321b4a

Browse files
committed
update
1 parent ed3adbf commit 8321b4a

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

42 files changed

+1081
-228
lines changed

notes/ai/README.md

Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -10,6 +10,24 @@ title: AI
1010

1111
- 规则系统、专家系统、机器学习
1212

13+
---
14+
15+
- fundamentals
16+
- [机器学习](./ml/README.md)
17+
- Deep Learning
18+
- models
19+
- [LLM](./llm/README.md)
20+
- [GPT](./gpt/README.md)
21+
- [GAN](./gan/README.md)
22+
- Diffusion
23+
- domains
24+
- [OCR](./ocr/README.md)
25+
- [NLP](./nlp/README.md)
26+
- [TTS](./tts/README.md)
27+
- [ASR](./asr/README.md)
28+
- services
29+
- OpenAI
30+
1331
## 解释
1432

1533
- 常见子领域

notes/ai/ai-faq.md

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -29,3 +29,10 @@ tags:
2929
- RAQ - retrieval-augmented generation - 检索增强生成
3030
- 参考
3131
- https://research.ibm.com/blog/retrieval-augmented-generation-RAG
32+
33+
## STT vs ASR
34+
35+
- STT: Speech to Text - 语音转文本
36+
- 产品功能描述
37+
- ASR: Automatic Speech Recognition - 自动语音识别
38+
- 技术

notes/ai/ai-glossary.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -40,6 +40,8 @@ tags:
4040
| en | cn |
4141
| ---------------- | -------- |
4242
| Stable Diffusion | 稳定扩散 |
43+
| Speech Synthesis | 语音合成 |
44+
| Voice Synthesis | 语音合成 |
4345

4446
## LLM 参数
4547

notes/ai/asr.md renamed to notes/ai/asr/README.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -5,6 +5,7 @@ tags:
55

66
# ASR
77

8+
- ASR - Automatic Speech Recognition - 自动语音识别
89
- [FunASR](./funasr.md)
910
- [Whisper](./whisper.md)
1011
- Kaldi

notes/ai/funasr.md renamed to notes/ai/asr/funasr.md

Lines changed: 23 additions & 23 deletions
Original file line numberDiff line numberDiff line change
@@ -36,30 +36,30 @@ bash run_server.sh \
3636
## Protocol
3737

3838
```ts
39-
interface OfflineRequestMessage {
40-
mode: 'offline';
41-
wav_name: string;
42-
wav_format: string | 'pcm' | 'mp3' | 'mp4';
43-
is_speaking: boolean; // false -> 断句尾点,例如,vad切割点,或者一条wav结束
44-
audio_fs?: number; // pcm 采样率
45-
hotwords?: Record<string, number>; // 热词
46-
itn?: boolean; // 默认 true
47-
}
39+
interface OfflineRequestMessage {
40+
mode: 'offline';
41+
wav_name: string;
42+
wav_format: string | 'pcm' | 'mp3' | 'mp4';
43+
is_speaking: boolean; // false -> 断句尾点,例如,vad切割点,或者一条wav结束
44+
audio_fs?: number; // pcm 采样率
45+
hotwords?: Record<string, number>; // 热词
46+
itn?: boolean; // 默认 true
47+
}
4848

49-
interface ResponseMessage {
50-
mode: 'offline';
51-
wav_name: string;
52-
text: string;
53-
is_final: boolean;
54-
timestamp?: number[][]; // 时间戳 "[[100,200], [200,500]]"(ms)
55-
stamp_sents?: {
56-
text_seg: string; // 正 是 因 为
57-
punc: string; // ,
58-
start: number;
59-
end: number;
60-
ts_list: number[][]; // [[430,670],[670,810],[810,1030],[1030,1130]]
61-
}[];
62-
}
49+
interface ResponseMessage {
50+
mode: 'offline';
51+
wav_name: string;
52+
text: string;
53+
is_final: boolean;
54+
timestamp?: number[][]; // 时间戳 "[[100,200], [200,500]]"(ms)
55+
stamp_sents?: {
56+
text_seg: string; // 正 是 因 为
57+
punc: string; // ,
58+
start: number;
59+
end: number;
60+
ts_list: number[][]; // [[430,670],[670,810],[810,1030],[1030,1130]]
61+
}[];
62+
}
6363
```
6464

6565
- mode
File renamed without changes.
File renamed without changes.

notes/ai/llm/ollama.md

Lines changed: 5 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -29,8 +29,11 @@ title: ollama
2929
```bash
3030
brew install ollama # macOS brew
3131

32-
OLLAMA_FLASH_ATTENTION=1 ollama serve # 启动服务端
33-
ollama run mistral # 运行模型
32+
# 启动服务端
33+
# OLLAMA_KV_CACHE_TYPE 0.5+
34+
OLLAMA_FLASH_ATTENTION=1 OLLAMA_KV_CACHE_TYPE=q4_0 ollama serve
35+
36+
ollama run mistral # 运行模型
3437
ollama list
3538

3639
# https://hub.docker.com/r/ollama/ollama

notes/ai/ml/ml-awesome.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -138,6 +138,9 @@ tags:
138138
- XCiT
139139
- DINO - Self-Supervised Vision Transformers
140140
- PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
141+
- GOT - Generic Object Tracking
142+
- [GOT-10k](http://got-10k.aitestunion.com/)
143+
- [GOT-10k: A Large High-Diversity Benchmark for Generic Object Tracking in the Wild](https://arxiv.org/abs/1810.11981)
141144
- audio/music/speech/voice/tts
142145
- [microsoft/muzic](https://github.com/microsoft/muzic)
143146
- [yl4579/StyleTTS2](https://github.com/yl4579/StyleTTS2)

notes/ai/ml/paddle.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -9,6 +9,7 @@ tags:
99
- Apache-2.0
1010
- by Baidu
1111
- 飞桨 - PADDLE -> PArallel Distributed Deep LEarning
12+
- 通用框架,但 Paddle 主要中文 OCR, NLP 做得好
1213
- 参考
1314
- https://www.paddlepaddle.org.cn/
1415
- [PaddlePaddle/PaddleHub](https://github.com/PaddlePaddle/PaddleHub)
@@ -22,6 +23,7 @@ pip install paddlepaddle
2223
pip install paddlepaddle-gpu
2324

2425
# Docker
26+
# 百度镜像 registry.baidubce.com/paddlepaddle/paddle:3.0.0b1
2527
docker run --rm -it -v $PWD:/host --entrypoint /host --name paddle paddlepaddle/paddle /bin/bash
2628
```
2729

0 commit comments

Comments
 (0)