roslein commited on
Commit
28e750b
·
verified ·
1 Parent(s): d39be35

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +40 -3
README.md CHANGED
@@ -1,3 +1,40 @@
1
- ---
2
- license: cc-by-nc-4.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-nc-4.0
3
+ base_model:
4
+ - CohereLabs/c4ai-command-a-03-2025
5
+ pipeline_tag: text-generation
6
+ ---
7
+
8
+ ## Model Card for Abliterated Cohere Labs Command A
9
+
10
+ ### Model Summary
11
+ This is an abliterated version of Cohere Labs Command A, a 111 billion parameter model originally optimized for enterprise use. The model has been modified using the Abliteration technique to alter its behavior by removing specific directions in the model's weight space.
12
+
13
+ ### Original Model Information
14
+ - Developed by: Cohere and Cohere Labs - **CohereLabs/c4ai-command-a-03-2025**
15
+
16
+ ### Original License: CC-BY-NC, with adherence to Cohere Lab's Acceptable Use Policy
17
+
18
+ ### Base Model: c4ai-command-a-03-2025
19
+ - Model Size: 111 billion parameters
20
+ - Context Length: 256K (configured for 128K in Hugging Face)
21
+
22
+ ### Abliteration Details
23
+ - Abliteration Method: Applied using the script from Abliteration-by-Transformers *https://github.com/JanRoslein/Abliteration-by-Transformers.git*
24
+ - Abliteration Parameters: Used --proportional-scaling and --max-scale-factor with value of 2.25
25
+ - Significantly Modified Layers:
26
+ layer_32_attn: Change magnitude = 2.171875 (1.4103%)
27
+ layer_32_mlp: Change magnitude = 3.546875 (1.3748%)
28
+ layer_62_attn: Change magnitude = 3.828125 (3.7531%)
29
+ layer_62_mlp: Change magnitude = 6.656250 (2.6206%)
30
+
31
+ ### Known Limitations
32
+ - The model may produce artifacts in some languages due to the significant modifications to specific layers.
33
+ - Performance characteristics differ from the original model, particularly in areas targeted by the abliteration process.
34
+
35
+ ### Usage Notes
36
+ - The abliteration process may have altered the model's behavior in ways that could affect its reliability and output quality.
37
+ - Users should be aware that the original model's licensing terms still apply to this derivative work.
38
+
39
+ ### Technical Details
40
+ The abliteration technique works by identifying and modifying specific directions in the model's weight space that are associated with certain behaviors. The proportional scaling approach used in this case applies different modification intensities to different layers based on their contribution to the targeted behavior.