duongve commited on
Commit
4b5a40b
·
verified ·
1 Parent(s): 2018b22

Upload 2 files

Browse files
Files changed (3) hide show
  1. .gitattributes +1 -0
  2. Example/Demo_v2.png +3 -0
  3. README.md +63 -0
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ Example/Demo_v2.png filter=lfs diff=lfs merge=lfs -text
Example/Demo_v2.png ADDED

Git LFS Details

  • SHA256: 15cfe114fb67ca72492d4cef5030ec30144fe225f2f88e28330fe5243d21ba28
  • Pointer size: 132 Bytes
  • Size of remote file: 7.75 MB
README.md ADDED
@@ -0,0 +1,63 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ pipeline_tag: text-to-image
3
+ license: apache-2.0
4
+ base_model:
5
+ - neta-art/Neta-Lumina
6
+ - Alpha-VLLM/Lumina-Image-2.0
7
+ tags:
8
+ - stable-diffusion
9
+ - text-to-image
10
+ - comfyui
11
+ - diffusion-single-file
12
+ ---
13
+
14
+ # NetaYume Lumina Image v2.0
15
+ ![NetaYume Lumina Image v2.0](./Example/Demo_v2.png)
16
+ This model is based on [Lumina-Image-2.0](https://huggingface.co/Alpha-VLLM/Lumina-Image-2.0), which is a DIT model with 2 billions parameter flow-based diffusion transformer. For more information, visit [here](https://github.com/Alpha-VLLM/Lumina-Image-2.0).
17
+
18
+ ---
19
+ **I. Introduction**
20
+
21
+ NetaYume Lumina is a text-to-image model fine-tuned from [Neta Lumina](https://huggingface.co/neta-art/Neta-Lumina), a high-quality anime-style image generation model developed by [Neta.art Lab](https://huggingface.co/neta-art). It builds upon [Lumina-Image-2.0](https://huggingface.co/Alpha-VLLM/Lumina-Image-2.0), an open-source base model released by the [Alpha-VLLM](https://huggingface.co/Alpha-VLLM) team at Shanghai AI Laboratory.
22
+
23
+ This model was trained with the goal of not only generating realistic human images but also producing high-quality anime-style images. Despite being fine-tuned on a specific dataset, it retains a significant amount of knowledge from the base model.
24
+
25
+ **Key Features:**
26
+ - **High-Quality Anime Generation**: Generates detailed anime-style images with sharp outlines, vibrant colors, and smooth shading.
27
+ - **Improved Character Understanding**: Better captures characters, especially those from the Danbooru dataset, resulting in more coherent and accurate character representations.
28
+ - **Enhanced Fine Details**: Accurately generates accessories, clothing textures, hairstyles, and background elements with greater clarity.
29
+
30
+
31
+ - The file NetaYume_Lumina_v2_all_in_one.safetensors is an all-in-one file that contains the necessary weights for the VAE, text encoder, and image backbone to be used with ComfyUI.
32
+
33
+ ---
34
+
35
+ 2. Model Components & Training Details
36
+ - **Text Encoder**: Pre-trained **Gemma-2-2b**
37
+ - **Variational Autoencoder**: Pre-trained **Flux.1 dev's VAE**
38
+ - **Image Backbone**: Fine-tune **NetaLumina's Image Backbone**
39
+
40
+ ---
41
+
42
+ 3. Suggestion
43
+
44
+ **System Prompt:** This help you generate your desired images more easily by understanding and aligning with your prompts.
45
+
46
+ For anime-style images using Danbooru tags:
47
+
48
+ You are an assistant designed to generate anime images based on textual prompts.
49
+
50
+ You are an assistant designed to generate high-quality images based on user prompts and danbooru tags.
51
+
52
+ **Recommended Settings**
53
+ - CFG: 4–8
54
+ - Sampling Steps: 40-50
55
+ - Sampler:
56
+ - Euler a (with scheduler: normal)
57
+ - res_multistep (with scheduler: linear_quadratic)
58
+
59
+ ---
60
+ 4. Acknowledgments
61
+ - [narugo1992](https://huggingface.co/narugo) – for the invaluable Danbooru dataset
62
+ - [Alpha-VLLM](https://huggingface.co/Alpha-VLLM) - for creating the a wonderful model!
63
+ - [Neta Lumina](https://huggingface.co/neta-art/Neta-Lumina) and his team – for openly sharing a wonderful model.