-
Notifications
You must be signed in to change notification settings - Fork 5
Expand file tree
/
Copy pathBitNet.yaml
More file actions
104 lines (85 loc) · 2.99 KB
/
BitNet.yaml
File metadata and controls
104 lines (85 loc) · 2.99 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
---
# Thank you for contributing!
# In filling out this yaml file, please follow the criteria as described here:
# https://osai-index.eu/contribute
# You're free to build on this work and reuse the data. It is licensed under CC-BY 4.0, with the
# stipulation that attribution should come in the form of a link to https://osai-index.eu/
# and a citation to the peer-reviewed paper in which the dataset & criteria were published:
# Liesenfeld, A. and Dingemanse, M., 2024. Rethinking open source generative AI: open-washing and the EU AI Act. In Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency (pp. 1774-1787).
# Organization tags:
# - National origin: United States
# - Contributor type: Non-academic (American Big Tech)
system:
name: BitNet
link: https://huggingface.co/microsoft/bitnet-b1.58-2B-4T
type: text
performanceclass: full
basemodelname:
endmodelname: BitNet-b1.58-2B4T
endmodellicense: MIT License
releasedate: 2025-04
notes: Native 1-bit LLM.
org:
name: Microsoft
link: https://huggingface.co/microsoft
notes: Major technology company.
# availability:
datasources_basemodel:
class: closed
link: https://arxiv.org/pdf/2504.12285
notes: Pretraining data said to include DCLM and FineWeb-EDU. No full description provided.
datasources_endmodel:
class: closed
link: https://arxiv.org/pdf/2504.12285
notes: Fine-tuning data said to include WildChat, LMSYS-Chat-1M, WizardLM-Evol-Instruct, and SlimOrca. No full description provided.
weights_basemodel:
class: closed
link:
notes: Weights for base model not made available.
weights_endmodel:
class: open
link: https://huggingface.co/microsoft/bitnet-b1.58-2B-4T
notes: Weights made available on HuggingFace.
trainingcode:
class: closed
link: https://github.com/microsoft/BitNet
notes: Only inference code found.
# documentation
code:
class: closed
link:
notes: No code, so no documentation
hardware_architecture:
class: closed
link: https://arxiv.org/pdf/2504.12285
notes: Only inference hardware information found.
preprint:
class: open
link: https://arxiv.org/pdf/2504.12285
notes: Preprint made available on arXiv.
paper:
class: closed
link:
notes: No peer-reviewed paper found.
modelcard:
class: partial
link: https://huggingface.co/microsoft/bitnet-b1.58-2B-4T
notes: Model card provides thorough information regarding architecture, fine-tuning, and evaluation. Information regarding training is provided to a lesser degree.
datasheet:
class: closed
link:
notes: No datasheet found.
# access
package:
class: closed
link:
notes: No package found.
api:
class: closed
link: https://bitnet-demo.azurewebsites.net/
notes: Demo available. No API found.
metaprompt: closed
licenses:
class: open
link: https://huggingface.co/microsoft/bitnet-b1.58-2B-4T/blob/main/LICENSE
notes: MIT License, an OSI-approved license.