Skip to content
View dailenson's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report dailenson

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

华南理工大学硕博士学位论文模板(LaTeX)。Latex templates for the thesis of South China University of Technology

TeX 364 64 Updated Jan 7, 2025

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 7,795 809 Updated Mar 21, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 16,863 2,206 Updated Feb 1, 2025

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python 26,092 3,285 Updated Sep 24, 2024

[CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding

Python 74 Updated Mar 12, 2025

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 72,265 7,839 Updated Mar 24, 2025

The research collection of typography

135 14 Updated Mar 25, 2025

Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.

Python 2,073 624 Updated Aug 9, 2023

CVPR2022 (Oral) - Rethinking Semantic Segmentation: A Prototype View

Python 361 41 Updated Jun 30, 2022
Python 26 Updated Nov 22, 2024

Official inference repo for FLUX.1 models

Python 21,017 1,488 Updated Feb 6, 2025

Official PyTorch implementation for "Semantically Coherent Montages by Merging and Splitting Diffusion Paths", presenting the Merge-Attend-Diffuse operator (ECCV24)

Python 13 1 Updated Sep 2, 2024

[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"

Python 773 40 Updated Aug 13, 2024
Python 77 4 Updated Dec 29, 2024

微软 tts 文本转语音 音频下载

JavaScript 889 199 Updated Mar 25, 2025

Oracle Bone Script data collected by VLRLab of HUST

Python 42 1 Updated Sep 2, 2024

[CVPR 2024] DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks

Python 401 45 Updated Jan 28, 2025

Official PyTorch Implementation of "DiffusionPen: Towards Controlling the Style of Handwritten Text Generation" - ECCV 2024

Python 44 7 Updated Oct 24, 2024

Font files available from Google Fonts, and a public issue tracker for all things Google Fonts

HTML 18,649 2,701 Updated Mar 24, 2025

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 7,311 641 Updated Feb 10, 2025

⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。

Python 15,266 1,617 Updated Feb 21, 2025
Python 1 Updated Aug 27, 2024

Official implementation of FIFO-Diffusion: Generating Infinite Videos from Text without Training (NeurIPS 2024)

Python 449 32 Updated Oct 18, 2024

UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models

Python 221 17 Updated Feb 14, 2025

Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld

Python 54 Updated Oct 4, 2024

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 5,027 527 Updated Mar 25, 2025

Official Code for ECCV 2024 paper — One-Shot Diffusion Mimicker for Handwritten Text Generation

Python 374 35 Updated Oct 29, 2024
Next
Showing results