-
Notifications
You must be signed in to change notification settings - Fork 1
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Working version of faithfulness dashboard with SAE directions!
- Loading branch information
1 parent
6407c04
commit 7d346f4
Showing
7 changed files
with
523 additions
and
256 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,22 @@ | ||
# Faithfulness | ||
|
||
This module demonstrates that SAE features are faithful and that the underlying vision model does in fact depend on the features to make its predictions. | ||
|
||
It demonstrates this through an interactive dashboard and through larger-scale quantitative experiments. | ||
|
||
## Dashboard | ||
|
||
First, record activations for the ADE20K dataset. | ||
|
||
```sh | ||
uv run python -m saev activations \ | ||
--model-group clip \ | ||
--model-ckpt ViT-B-16/openai \ | ||
--d-vit 768 \ | ||
--n-patches-per-img 196 \ | ||
--layers -2 \ | ||
--dump-to /local/scratch/$USER/cache/saev \ | ||
--n-patches-per-shard 2_4000_000 \ | ||
data:ade20k-dataset \ | ||
--data.root /research/nfs_su_809/workspace/stevens.994/datasets/ade20k/images | ||
``` |
Oops, something went wrong.