国产无码综合区,色欲AV无码国产永久播放,无码天堂亚洲国产AV,国产日韩欧美女同一区二区

<p id="bsbbu"></p>

<thead id="bsbbu"></thead>

<i id="bsbbu"><ins id="bsbbu"></ins></i>

<thead id="bsbbu"></thead>

<var id="bsbbu"><pre id="bsbbu"></pre></var>

<thead id="bsbbu"><pre id="bsbbu"></pre></thead>

<var id="bsbbu"><pre id="bsbbu"><strike id="bsbbu"></strike></pre></var>

Nougat：結(jié)合光學神經(jīng)網(wǎng)絡，引領學術(shù)PDF文檔的智能解析、挖掘?qū)W術(shù)論文PDF的價值

2年前作者：汀、人工智能分類：Toy博客閱讀(19)違法舉報

這篇具有很好參考價值的文章主要介紹了Nougat：結(jié)合光學神經(jīng)網(wǎng)絡，引領學術(shù)PDF文檔的智能解析、挖掘?qū)W術(shù)論文PDF的價值。希望對大家有所幫助。如果存在錯誤或未考慮完全的地方，請大家不吝賜教，您也可以點擊"舉報違法"按鈕提交疑問。

Nougat：結(jié)合光學神經(jīng)網(wǎng)絡，引領學術(shù)PDF文檔的智能解析、挖掘?qū)W術(shù)論文PDF的價值

這是Nougat的官方存儲庫，Nougat是一種學術(shù)文檔PDF解析器，可以理解LaTeX數(shù)學和表格。

Project page: https://facebookresearch.github.io/nougat/

1.安裝

From pip:

pip install nougat-ocr

From repository:

pip install git+https://github.com/facebookresearch/nougat

Note, on Windows: If you want to utilize a GPU, make sure you first install the correct PyTorch version. Follow instructions here

如果您想從API調(diào)用模型或生成數(shù)據(jù)集，則會有額外的依賴項。
安裝通過

pip install "nougat-ocr[api]" or pip install "nougat-ocr[dataset]"

1.2 獲取PDF的預測

1.2.1 CLI

To get predictions for a PDF run

$ nougat path/to/file.pdf -o output_directory

目錄或文件的路徑(其中每行都是PDF的路徑)也可以作為位置參數(shù)傳遞

$ nougat path/to/directory -o output_directory

usage: nougat [-h] [--batchsize BATCHSIZE] [--checkpoint CHECKPOINT] [--model MODEL] [--out OUT]
              [--recompute] [--markdown] [--no-skipping] pdf [pdf ...]

positional arguments:
  pdf                   PDF(s) to process.

options:
  -h, --help            show this help message and exit
  --batchsize BATCHSIZE, -b BATCHSIZE
                        Batch size to use.
  --checkpoint CHECKPOINT, -c CHECKPOINT
                        Path to checkpoint directory.
  --model MODEL_TAG, -m MODEL_TAG
                        Model tag to use.
  --out OUT, -o OUT     Output directory.
  --recompute           Recompute already computed PDF, discarding previous predictions.
  --full-precision      Use float32 instead of bfloat16. Can speed up CPU conversion for some setups.
  --no-markdown         Do not add postprocessing step for markdown compatibility.
  --markdown            Add postprocessing step for markdown compatibility (default).
  --no-skipping         Don't apply failure detection heuristic.
  --pages PAGES, -p PAGES
                        Provide page numbers like '1-4,7' for pages 1 through 4 and page 7. Only works for single PDFs.

The default model tag is 0.1.0-small. If you want to use the base model, use 0.1.0-base.

$ nougat path/to/file.pdf -o output_directory -m 0.1.0-base

In the output directory every PDF will be saved as a .mmd file, the lightweight markup language, mostly compatible with Mathpix Markdown (we make use of the LaTeX tables).

Note: On some devices the failure detection heuristic is not working properly. If you experience a lot of [MISSING_PAGE] responses, try to run with the --no-skipping flag. Related: #11, #67

1.2.2 API

With the extra dependencies you use app.py to start an API. Call

$ nougat_api

通過向http://127.0.0.1:8503/ predict/發(fā)出POST請求來獲得PDF文件的預測。它還接受參數(shù)“start”和“stop”，以限制計算選擇頁碼(包括邊界)。

響應是一個帶有文檔標記文本的字符串。

curl -X 'POST' \
  'http://127.0.0.1:8503/predict/' \
  -H 'accept: application/json' \
  -H 'Content-Type: multipart/form-data' \
  -F 'file=@<PDFFILE.pdf>;type=application/pdf'

To use the limit the conversion to pages 1 to 5, use the start/stop parameters in the request URL: http://127.0.0.1:8503/predict/?start=1&stop=5

2.Dataset

2.1 生成數(shù)據(jù)集

To generate a dataset you need

A directory containing the PDFs
A directory containing the .html files (processed .tex files by LaTeXML) with the same folder structure
A binary file of pdffigures2 and a corresponding environment variable export PDFFIGURES_PATH="/path/to/binary.jar"

Next run

python -m nougat.dataset.split_htmls_to_pages --html path/html/root --pdfs path/pdf/root --out path/paired/output --figure path/pdffigures/outputs

Additional arguments include

Argument	Description
`--recompute`	recompute all splits
`--markdown MARKDOWN`	Markdown output dir
`--workers WORKERS`	How many processes to use
`--dpi DPI`	What resolution the pages will be saved at
`--timeout TIMEOUT`	max time per paper in seconds
`--tesseract`	Tesseract OCR prediction for each page

Finally create a jsonl file that contains all the image paths, markdown text and meta information.

python -m nougat.dataset.create_index --dir path/paired/output --out index.jsonl

For each jsonl file you also need to generate a seek map for faster data loading:

python -m nougat.dataset.gen_seek file.jsonl

The resulting directory structure can look as follows:

root/
├── images
├── train.jsonl
├── train.seek.map
├── test.jsonl
├── test.seek.map
├── validation.jsonl
└── validation.seek.map

Note that the .mmd and .json files in the path/paired/output (here images) are no longer required.
This can be useful for pushing to a S3 bucket by halving the amount of files.

2.2Training

To train or fine tune a Nougat model, run

python train.py --config config/train_nougat.yaml

2.3 Evaluation

Run

python test.py --checkpoint path/to/checkpoint --dataset path/to/test.jsonl --save_path path/to/results.json

To get the results for the different text modalities, run

python -m nougat.metrics path/to/results.json

2.4 FAQ

Why am I only getting [MISSING_PAGE]?

Nougat was trained on scientific papers found on arXiv and PMC. Is the document you’re processing similar to that?
What language is the document in? Nougat works best with English papers, other Latin-based languages might work. Chinese, Russian, Japanese etc. will not work.
If these requirements are fulfilled it might be because of false positives in the failure detection, when computing on CPU or older GPUs (#11). Try passing the --no-skipping flag for now.
Where can I download the model checkpoint from.

They are uploaded here on GitHub in the release section. You can also download them during the first execution of the program. Choose the preferred preferred model by passing --model 0.1.0-{base,small}

參考鏈接：
https://github.com/facebookresearch/nougat

更多優(yōu)質(zhì)內(nèi)容請關(guān)注公號：汀丶人工智能；會提供一些相關(guān)的資源和優(yōu)質(zhì)文章，免費獲取閱讀。文章來源地址http://www.zghlxwxcb.cn/news/detail-757918.html

到了這里，關(guān)于Nougat：結(jié)合光學神經(jīng)網(wǎng)絡，引領學術(shù)PDF文檔的智能解析、挖掘?qū)W術(shù)論文PDF的價值的文章就介紹完了。如果您還想了解更多內(nèi)容，請在右上角搜索TOY模板網(wǎng)以前的文章或繼續(xù)瀏覽下面的相關(guān)文章，希望大家以后多多支持TOY模板網(wǎng)！

本文來自互聯(lián)網(wǎng)用戶投稿，該文觀點僅代表作者本人，不代表本站立場。本站僅提供信息存儲空間服務，不擁有所有權(quán)，不承擔相關(guān)法律責任。如若轉(zhuǎn)載，請注明出處：如若內(nèi)容造成侵權(quán)/違法違規(guī)/事實不符，請點擊違法舉報進行投訴反饋，一經(jīng)查實，立即刪除！

分享到：

領支付寶紅包贊助服務器費用

RBF-UKF徑向基神經(jīng)網(wǎng)絡結(jié)合無跡卡爾曼濾波估計鋰離子電池SOC（附MATLAB代碼）RBF神經(jīng)網(wǎng)絡訓練部分
1.清空變量 2.導入數(shù)據(jù)用以RBF神經(jīng)網(wǎng)絡訓練，一共14組，訓練數(shù)據(jù)P（第一列為電壓值，第二列為SOC值，第三列為電流值。），并將所有數(shù)據(jù)存儲在變量PP中，所有電壓數(shù)據(jù)存儲在變量TT中。 3. 用第1、2、3、4、5組數(shù)據(jù)來訓練網(wǎng)絡，用第六組數(shù)據(jù)來測試網(wǎng)絡的精度。 ? 4.建立
2023年04月22日
瀏覽(67)
目標檢測算法——YOLOv5/YOLOv7改進結(jié)合新神經(jīng)網(wǎng)絡算子Involution（CVPR 2021）
2023年04月23日
瀏覽(30)
時序預測 | Matlab實現(xiàn)SOM-BP自組織映射結(jié)合BP神經(jīng)網(wǎng)絡時間序列預測
預測效果基本介紹 1.Matlab實現(xiàn)SOM-BP自組織映射結(jié)合BP神經(jīng)網(wǎng)絡時間序列預測（完整源碼和數(shù)據(jù))； 2.數(shù)據(jù)集為excel，單列時間序列數(shù)據(jù)集，運行主程序main.m即可，其余為函數(shù)文件，無需運行； 3.優(yōu)化參數(shù)為神經(jīng)網(wǎng)絡的權(quán)值和偏置，命令窗口輸出RMSE、MAPE、MAE、R2等評價指標； 4
2024年04月12日
瀏覽(25)
時序預測 | Matlab實現(xiàn)CNN-XGBoost卷積神經(jīng)網(wǎng)絡結(jié)合極限梯度提升樹時間序列預測
效果一覽基本描述時序預測 | Matlab實現(xiàn)CNN-XGBoost卷積神經(jīng)網(wǎng)絡結(jié)合極限梯度提升樹時間序列預測。 Matlab實現(xiàn)CNN-XGBoost卷積神經(jīng)網(wǎng)絡結(jié)合極限梯度提升樹時間序列預測（完整源碼和數(shù)據(jù)） 1.data為數(shù)據(jù)集，單變量時間序列數(shù)據(jù)集。 2.CNN_XGBoostTS.m為主程序文件，其他為函數(shù)文件，
2024年02月10日
瀏覽(21)
回歸預測 | MATLAB實現(xiàn)基于BP-Adaboost的BP神經(jīng)網(wǎng)絡結(jié)合AdaBoost多輸入單輸出回歸預測
預測效果基本介紹 1.MATLAB實現(xiàn)基于BP-Adaboost的BP神經(jīng)網(wǎng)絡結(jié)合AdaBoost多輸入單輸出回歸預測； 2.運行環(huán)境為Matlab2018b； 3.輸入多個特征，輸出單個變量，多變量回歸預測； 4.data為數(shù)據(jù)集，excel數(shù)據(jù)，前7列輸入，最后1列輸出，主程序運行即可,所有文件放在一個文件夾； 5.命令窗
2024年02月08日
瀏覽(47)
時序預測 | Matlab基于CNN-LSTM-SAM卷積神經(jīng)網(wǎng)絡-長短期記憶網(wǎng)絡結(jié)合空間注意力機制的時間序列預測(多指標評價)
預測效果基本介紹 Matlab基于CNN-LSTM-SAM卷積神經(jīng)網(wǎng)絡-長短期記憶網(wǎng)絡結(jié)合空間注意力機制的時間序列預測(多指標評價) 卷積神經(jīng)網(wǎng)絡（Convolutional Neural Network, CNN）和長短期記憶網(wǎng)絡（Long Short-Term Memory, LSTM）是兩種在深度學習領域中廣泛應用的神經(jīng)網(wǎng)絡模型。而空間注意力（
2024年01月25日
瀏覽(28)
時序預測 | Python實現(xiàn)ARIMA-LSTM差分自回歸移動平均模型結(jié)合長短期記憶神經(jīng)網(wǎng)絡時間序列預測
預測效果基本介紹時序預測 | Python實現(xiàn)ARIMA-LSTM差分自回歸移動平均模型結(jié)合長短期記憶神經(jīng)網(wǎng)絡時間序列預測直接替換數(shù)據(jù)即可用適合新手小白附贈案例數(shù)據(jù) 可直接運行程序設計完整程序和數(shù)據(jù)下載方式私信博主回復： Python實現(xiàn)ARIMA-LSTM差分自回歸移動平均模型結(jié)合長
2024年02月07日
瀏覽(32)
時序預測 | MATLAB實現(xiàn)EEMD-LSTM、LSTM集合經(jīng)驗模態(tài)分解結(jié)合長短期記憶神經(jīng)網(wǎng)絡時間序列預測對比
效果一覽基本介紹時序預測 | MATLAB實現(xiàn)EEMD-LSTM、LSTM集合經(jīng)驗模態(tài)分解結(jié)合長短期記憶神經(jīng)網(wǎng)絡時間序列預測對比。 1.MATLAB實現(xiàn)EEMD-LSTM、LSTM時間序列預測對比; 2.時間序列預測就是先eemd把原輸入全分解變成很多維作為輸入再輸入LSTM預測 ; 3.運行環(huán)境Matlab2018b及以上，輸出RM
2024年02月13日
瀏覽(28)
分類預測 | Matlab實現(xiàn)基于MIC-BP最大互信息系數(shù)數(shù)據(jù)特征選擇算法結(jié)合BP神經(jīng)網(wǎng)絡的數(shù)據(jù)分類預測
效果一覽基本介紹 Matlab實現(xiàn)基于MIC-BP最大互信息系數(shù)數(shù)據(jù)特征選擇算法結(jié)合BP神經(jīng)網(wǎng)絡的數(shù)據(jù)分類預測（Matlab完整程序和數(shù)據(jù)） 1.最大互信息系數(shù)MIC(數(shù)據(jù)特征選擇算法)的分類預測，MIC特征選擇分類預測，多輸入單輸出模型。 2.多特征輸入模型，直接替換數(shù)據(jù)就可以用。 3.語
2024年02月13日
瀏覽(21)
區(qū)間預測 | Matlab實現(xiàn)CNN-BiLSTM-KDE的卷積雙向長短期神經(jīng)網(wǎng)絡結(jié)合核密度估計多變量時序區(qū)間預測
效果一覽基本介紹 1.CNN-BiLSTM-KDE多變量時間序列區(qū)間預測，基于卷積雙向長短期記憶神經(jīng)網(wǎng)絡多變量時序區(qū)間預測，卷積雙向長短期記憶神經(jīng)網(wǎng)絡的核密度估計下置信區(qū)間預測。 2.含點預測圖、置信區(qū)間預測圖、核密度估計圖，區(qū)間預測(區(qū)間覆蓋率PICP、區(qū)間平均寬度百分比
2024年02月02日
瀏覽(24)

<td id="p9yyp"><table id="p9yyp"></table></td>

<thead id="p9yyp"></thead>

<span id="p9yyp"></span>