A light-weight, extendable, high level, universal code parser built on top of tree-sitter
-
Updated
Dec 2, 2021 - Python
A light-weight, extendable, high level, universal code parser built on top of tree-sitter
(1) Code mining + clickable code paths in vscode + system terminal: https://github.com/qualiu/vscode-msr (2) Visual Studio (2012~2022+) Clickable terminal integration: https://github.com/qualiu/msrTools/tree/master/code/vs-conemu (3) UI helper for msr/nin: https://github.com/qualiu/msrUI
(1) Find definition + Code mining + File processing via menu/mouse/terminal in vscode or command out-of vscode. (2) Vscode + other IDEs + system terminal integration. (3) Visual Studio (like VS2022) terminal integration (clickable file paths): https://github.com/qualiu/msrTools/blob/master/code/vs-conemu/README.md
Sievio turns GitHub, local repos, and web PDFs into clean JSONL for LLM pretraining, fine-tuning, and RAG. It offers structure-aware chunking, reliable Unicode decoding, pluggable QC and safety checks, plus optional dataset cards and deduplication.
Analyzing and Supporting Adaptation of Online Code Examples (ICSE 2019)
Fetch all kernels written for competitions from Kaggle.
Add a description, image, and links to the code-mining topic page so that developers can more easily learn about it.
To associate your repository with the code-mining topic, visit your repo's landing page and select "manage topics."