LinkBERT: A Knowledgeable Language Model Pretrained with Document Links

This repo provides the model, code & data of our paper: LinkBERT: Pretraining Language Models with Document Links (ACL 2022). @InProceedings{yasunaga2022linkbert, author = {Michihiro Yasunaga and Jure Leskovec and Percy Liang}, title = {LinkBERT: Pretraining Language Models with Document Links}, year = {2022}, booktitle = {Association for Computational Linguistics (ACL)}, } Overview LinkBERT is a new pretrained language model (improvement of BERT) that captures document links such as hyperlinks and citation    

Read more

Automatic documentation from sources for MkDocs

Automatic documentation from sources, for MkDocs. Features Language agnostic: just like mkdocs, mkdocstrings is written in Python but is language-agnostic. It means you can use it for any language, as long as you implement a handler for it. Currently, we only have a Python handler. Maybe you’d like to contribute another one?? Multiple themes support: each handler can offer multiple themes. Currently, we offer the⭐Material theme⭐as well as basic support for the ReadTheDocs theme for the Python handler. Cross-references to […]

Read more

Run black on python code blocks in documentation files

Run black on python code blocks in documentation files. install pip install blacken-docs usage blacken-docs provides a single executable (blacken-docs) which will modify .rst / .md / .tex files in place. It currently supports the following black options: -l / –line-length -t / –target-version -S / –skip-string-normalization Following additional parameters can be used: blacken-docs will format code in the following block types: (markdown) “`python def hello(): print(“hello world”) “` (rst)    

Read more

Main repository for the Sphinx documentation builder

Sphinx is a tool that makes it easy to create intelligent and beautiful documentation for Python projects (or other documents consisting of multiple reStructuredText sources), written by Georg Brandl. It was originally created for the new Python documentation, and has excellent facilities for Python project documentation, but C/C++ is supported as well, and more languages are planned. Sphinx uses reStructuredText as its    

Read more

A document format conversion service based on Pandoc

reformed Document format conversion service based on Pandoc. Usage The API specification for the Reformed server is as follows: GET /api/v1/formats: Lists available input and output formats for documents Response { “input”: { “commonmark”: { “mime”: “text/markdown”, “ext”: “md”, “detail”: “CommonMark Markdown” }, “docx”: { “mime”: “application/vnd.openxmlformats-officedocument.wordprocessingml.document”, “ext”: “docx”, “detail”: “Word docx” }, // … }, “output”: { “commonmark”: { “mime”: “text/markdown”, “ext”: “md”, “detail”: “CommonMark Markdown” }, “docx”: { “mime”: “application/vnd.openxmlformats-officedocument.wordprocessingml.document”, “ext”: “docx”, “detail”: “Word docx” }, // … […]

Read more

Create Python API documentation in Markdown format

Pydoc-Markdown Pydoc-Markdown is a tool and library to create Python API documentation in Markdown format based on lib2to3, allowing it to parse your Python code without executing it. Pydoc-Markdown requires Python 3.7 or newer, however the code that you want to generate API documentation for can be for any Python version. Features Understands multiple doc styles (Sphinx, Google, Pydoc-Markdown) Supports assignment docstrings (#: block before or string literal after the statement) Links references to other documented API objects [WIP] [MkDocs][], […]

Read more

Seamlessly integrate pydantic models in your Sphinx documentation

autodoc_pydantic Seamlessly integrate pydantic models in your Sphinx documentation. You love pydantic ❤ and you want todocument your models and configuration settings with sphinx? Perfect, let’s go. But wait, sphinx’ autodocdoes not integrate too well with pydantic models 😕. Don’t worry – just pip install autodoc_pydantic ☺. Features 💬 provides default values, alias and constraints for model fields 🔗 adds hyperlinks between validators and corresponding fields 📃 includes collapsable model json schema 🏄 natively integrates with autodoc and autosummary extensions […]

Read more

A simple python library for generating documentation from docstrings

inkpot a small simple library for generating documentation from docstrings Installation pip install inkpot Usage singel file python3 -m inkpot myfile.py or directory python3 -m inkpot myproject/ output to a file (also works with directories) python3 -m inkpot myfile.py > doc.md python3 -m inkpot myproject/ > doc.md Currently this returns a markdown table. More functionality and a better format will be added. Example Python file ex/add.py def add(a,b): “”” add to objects “”” return a + b python3 -m inkpot […]

Read more

Multiple-level labeling on document images and can generate in multiple languages

SDL: Synthetic Document Layout dataset SDL is the project that synthesizes document images. It facilitates multiple-level labeling on document images and can generate in multiple languages. Sample image Structure of data Quick start python flexible_layout.py –config_file configs/page.yaml Instruction to run data generation Go to instruction Visualization of the result python data_manipulation/visualize.py Vietnamese 300000 images link: Release soon Paper https://arxiv.org/abs/2106.15117 GitHub https://github.com/tson1997/SDL-Document-Image-Generation    

Read more

A document organizer with tags and full-text-search in sqlite3

bibliothecula document organizer with tags and full-text-search, in a simple and clean sqlite3 schema. bibliothēcula f (genitive bibliothēculae); first declension (Late Latin) small library small collection of books document database with tags and full-text-search, in a simple and clean sqlite3 schema Organise documents with tags and other metadata with the option of storing multiple files per document. See the database schema and the documentation. Uses Organise journal articles for bibliographies Organise e-books Store plain text notes with automatic full-text search […]

Read more
1 2