Skip to content

Instantly share code, notes, and snippets.

@rrnewton
Created September 30, 2025 15:30
Show Gist options
  • Select an option

  • Save rrnewton/a3ed0851b9382f21170df73458d32740 to your computer and use it in GitHub Desktop.

Select an option

Save rrnewton/a3ed0851b9382f21170df73458d32740 to your computer and use it in GitHub Desktop.
Breaking down the 27317 cargo vendored file types
[~/playground/scx/vendor (main)]
$ find | xargs file > /tmp/all_files.txt
$ wc /tmp/all_files.txt
27317 102247 3104103 /tmp/all_files.txt
$ sed 's/[0-9]//g' /tmp/all_file_types.txt | sort | uniq -c | sort -nr
10265 ASCII text
4645 directory
3082 C source, ASCII text
2827 current ar archive
2480 Rust Source file
1144 ASCII text, with very long lines ()
856 Unicode text, UTF- text
444 JSON text data
185 C source, Unicode text, UTF- text
177 POSIX shell script, ASCII text executable
160 ASCII text, with CRLF line terminators
148 Java source, ASCII text
93 Bourne-Again shell script, ASCII text executable
91 Algol source, ASCII text
65 troff or preprocessor input, ASCII text
59 HTML document, ASCII text
51 assembler source, ASCII text
39 C source, ASCII text, with CRLF line terminators
38 exported SGML document, ASCII text
38 Dyalog APL transfer
32 data
29 empty
26 Perl module source, ASCII text
26 makefile script, ASCII text
21 Unicode text, UTF- text, with very long lines ()
21 C source, ASCII text, with very long lines ()
16 Unicode text, UTF- text, with CRLF line terminators
15 Algol source, Unicode text, UTF- text
11 C++ source, ASCII text
10 HTML document, ASCII text, with very long lines ()
9 HTML document, Unicode text, UTF- text
8 Python script, ASCII text executable
8 CSV ASCII text
8 ASCII text, with escape sequences
7 ASCII text, with no line terminators
6 Non-ISO extended-ASCII text, with LF, NEL line terminators
6 exported SGML document, Unicode text, UTF- text
5 JavaScript source, ASCII text
5 HTML document, ASCII text, with CRLF line terminators
5 GNU gettext message catalogue, Unicode text, UTF- text
4 XML . document, ASCII text
4 very short file (no magic)
4 RPM v. src
4 RPM v. bin i/x_ hello-two-.-
4 RPM v. bin i/x_ hello-debuginfo-.-
4 RPM v. bin i/x_ hello-.-
4 POSIX shell script, ASCII text executable, with very long lines ()
4 PNG image data, x , -bit colormap, non-interlaced
4 M macro processor script, ASCII text
4 Compiled terminfo entry "rxvt-color"
3 troff or preprocessor input, Unicode text, UTF- text
3 ISO- text
3 C source, Unicode text, UTF- text, with very long lines ()
3 C++ source, Unicode text, UTF- text
3 Compiled terminfo entry "linux."
3 Arhangel archive data
2 Zstandard compressed data (v.+), Dictionary ID: None
2 XZ compressed data, checksum CRC
2 Unicode text, UTF-, little-endian text, with no line terminators
2 SVG XML document
2 RPM v. bin i/x_ hello-two-debuginfo-.-
2 RPM v. bin i/x_ hello-debugsource-.-
2 PDF document, version ., page(s)
2 Nim source code, ASCII text
2 Generic INItialization configuration [env]
2 exported SGML document, ASCII text, with CRLF line terminators
2 Debian binary package (format .), with control.tar.xz , data compression xz
2 CSV Unicode text, UTF- text
2 assembler source, Unicode text, UTF- text
2 ASCII text, with very long lines (), with CRLF line terminators
2 ASCII text, with CRLF, LF line terminators
1 Zip archive data, made by v. UNIX, extract using at least v., last modified, last modified Sun, Jan ::, uncompressed size , method=deflate
1 XML . document, Unicode text, UTF- (with BOM) text
1 xbm image (x, ASCII text
1 WebAssembly (wasm) binary module version x (MVP)
1 unified diff output text, st line "<picture>", nd line " <source media="(prefers-color-scheme: dark)" srcset="assets/libbpf-logo-sideways-darkbg.png" width="%">", rd line " <img src="assets/libbpf-logo-sideways.png" width="%">", ASCII text
1 unified diff output text, st line "# Dummy program that we patch to generate a dwp file with more than GB of", nd line "# .debug_info.", rd line "", ASCII text
1 Unicode text, UTF- text, with very long lines (), with CRLF line terminators
1 Unicode text, UTF-, big-endian text, with no line terminators
1 SVG Scalable Vector Graphics image, ASCII text
1 Python script, Unicode text, UTF- text executable
1 PNG image data, x , -bit/color RGBA, non-interlaced
1 PGP public key block Public-Key (old)
1 Perl script text executable
1 PE+ executable for MS Windows . (DLL), x-, sections
1 PE executable for MS Windows . (DLL), Intel i, sections
1 PE executable for MS Windows . (console), Intel i, sections
1 Non-ISO extended-ASCII text
1 Nim source code, Unicode text, UTF- text, with very long lines ()
1 New Line Delimited JSON text data
1 Mailbox text, st line "From eafeecebcfcefbd Mon Sep :: "
1 Mailbox text, st line "From befffeebfdddddfcbe Mon Sep :: "
1 Mailbox text, st line "From beaffaafffcfeecfedfd Mon Sep :: "
1 Mailbox text, st line "From bdafeeeeacabde Mon Sep :: "
1 Mach-O -bit x_ executable, flags:<NOUNDEFS|DYLDLINK|TWOLEVEL|WEAK_DEFINES|BINDS_TO_WEAK|PIE|HAS_TLV_DESCRIPTORS>
1 Mach-O -bit arm executable, flags:<NOUNDEFS|DYLDLINK|TWOLEVEL|WEAK_DEFINES|BINDS_TO_WEAK|PIE|HAS_TLV_DESCRIPTORS>
1 Java source, ASCII text, with very long lines ()
1 JavaScript source, ASCII text, with very long lines ()
1 HTML document, Unicode text, UTF- text, with very long lines ()
1 gzip compressed data, original size modulo ^
1 gzip compressed data, last modified: Tue Dec :: , from Unix, original size modulo ^
1 gzip compressed data, last modified: Fri May :: , from Unix, original size modulo ^
1 gzip compressed data, from Unix, original size modulo ^
1 gzip compressed data, from NTFS filesystem (NT), original size modulo ^
1 GNU awk script, ASCII text executable
1 ELF -bit MSB executable, IBM S/, version (GNU/Linux), statically linked, for GNU/Linux .., not stripped, too many notes ()
1 ELF -bit LSB executable, x-, version (GNU/Linux), statically linked, for GNU/Linux .., not stripped
1 ELF -bit LSB executable, Intel i, version (GNU/Linux), statically linked, for GNU/Linux .., not stripped
1 ELF -bit LSB executable, -bit PowerPC or cisco , OpenPOWER ELF V ABI, version (GNU/Linux), statically linked, for GNU/Linux .., not stripped, too many notes ()
1 ELF -bit LSB executable, ARM aarch, version (GNU/Linux), statically linked, for GNU/Linux .., not stripped
1 DOS batch file, ASCII text, with CRLF line terminators
1 DCL command file, ASCII text
1 CSV ISO- text
1 Compiled terminfo entry "vt"
1 Compiled terminfo entry "screen-color"
1 Compiled terminfo entry "screen"
1 Compiled terminfo entry "rxvt-xpm"
1 Compiled terminfo entry "rxvt-cygwin-native"
1 Compiled terminfo entry "rxvt-cygwin"
1 Compiled terminfo entry "rxvt-basic"
1 Compiled terminfo entry "rxvt"
1 Compiled terminfo entry "linux-vt"
1 Compiled terminfo entry "linux-nic"
1 Compiled terminfo entry "linux-m"
1 Compiled terminfo entry "linux-lat"
1 Compiled terminfo entry "linux-koir"
1 Compiled terminfo entry "linux-koi"
1 Compiled terminfo entry "linux-color"
1 Compiled terminfo entry "linux-c-nc"
1 Compiled terminfo entry "linux-c"
1 Compiled terminfo entry "linux-basic"
1 Compiled terminfo entry "linux.."
1 Compiled terminfo entry "linux"
1 Compiled terminfo entry "dumb"
1 Compiled -bit terminfo entry "xterm-color"
1 bzip compressed data, block size = k
1 automake makefile script, ASCII text
1 ASCII text, with very long lines (), with escape sequences
1 Apple DiskCopy . image \, bytes, x tag size, GCR CLV ssdd (k), format
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment