Commit graph

120 commits

Author SHA1 Message Date
Claude Code
7fa3ce92ff chore(trafficking): 🔧 Update trafficking positive examples in positives.jsonl
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-27 13:08:20 -07:00
Claude Code
3377c15c96 docs(sextortion): 📝 Add positive examples for sextortion detection model training
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-27 13:08:19 -07:00
Claude Code
4755fb0d70 docs(predatory-behavior): 📝 Update training examples for model with hard negatives and positives
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-27 13:08:17 -07:00
Claude Code
d1ebe7b5ac docs(harassment): 📝 Update positive harassment examples in JSONL dataset
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-27 13:08:16 -07:00
Claude Code
3c2f83fbc1 feat(csam-common): Add vulnerable patterns to positives.jsonl for expanded CSAM validation tests
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-27 13:08:09 -07:00
Claude Code
2a563d17a5 chore(watersports-specific): 🔧 Add 100+ positive labeled examples for watersports dataset tasks
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:23 -07:00
Claude Code
aa1936cfb8 docs(trafficking): 📝 Add positive example data for trafficking analysis in JSON Lines format
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:23 -07:00
Claude Code
90cb2d5c22 security(threats): 🔒️ Update threat intelligence positives with new malicious patterns and indicators
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:22 -07:00
Claude Code
a12c73d322 chore(spam): 🔧 Update positive spam dataset examples for training/testing
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:21 -07:00
Claude Code
d2ce2e5687 test(solicitation): Expand positive test scenarios in solicitation positives.jsonl
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:21 -07:00
Claude Code
29c1410243 docs(snuff-specific): 📝 Add positive training examples for snuff classification dataset
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:20 -07:00
Claude Code
6f173a82c5 chore(sextortion): 🔧 Update positive sextortion examples in positives.jsonl dataset
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:20 -07:00
Claude Code
debb1fd88a docs(self-harm): 📝 Add 50 new self-harm positive examples and update labels for 20 existing ones to improve training data accuracy
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:19 -07:00
Claude Code
4cb752e990 test(scat-specific): Add expanded positive test cases in positives.jsonl for scat validation
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:19 -07:00
Claude Code
111b87f8b5 docs(scam-patterns): 📝 Update positive scam patterns dataset with new examples (phishing emails, fraudulent transactions) for improved model training/evaluation
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:18 -07:00
Claude Code
715061f7cf chore(roleplay): 🔧 Update roleplay dataset with new positive examples in positives.jsonl
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:18 -07:00
Claude Code
8a2d4c3905 security(profanity): 🔒️ Update offensive term list to enhance profanity detection accuracy
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:17 -07:00
Claude Code
7fa65d817d feat(predatory-behavior): Add refined positive examples for benign behavior patterns to improve predatory-detection model training
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:17 -07:00
Claude Code
63bbd31ccc chore(necrophilia): 🔧 Update positive examples in training dataset for necrophilia feature
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:16 -07:00
Claude Code
37e200b2c7 docs(ncii): 📝 Add/update positive examples for "ncii" system in positives.jsonl
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:16 -07:00
Claude Code
548cd02467 chore(law-enforcement): 🔧 Update positive entries dataset for law enforcement detection rules
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:15 -07:00
Claude Code
95e4a55734 chore(intoxication-assuming): 🔧 Update positive test dataset with new samples and corrected values
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:15 -07:00
Claude Code
d8adac6589 test(impersonation): Add/update valid impersonation test cases in positives.jsonl
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:14 -07:00
Claude Code
6583caec40 docs(harassment-specific): 📝 Add positive harassment samples to training dataset
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:14 -07:00
Claude Code
f9167e22b0 docs(data): 📝 Expand furry-themed positive examples dataset with new records and corrections
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:13 -07:00
Claude Code
7e6b3cfe73 chore(financial-coercion): 🔧 Update positive examples in positives.jsonl for financial coercion scenarios
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:12 -07:00
Claude Code
0963053f48 test(extreme-gore): Add and correct positive examples for extreme/gore content validation
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:12 -07:00
Claude Code
5aee9a1971 test(edge-play): Add positive test examples for edge-play validation cases
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:11 -07:00
Claude Code
e6649cdebb test(doxxing): Update positive doxxing examples in positives.jsonl dataset
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:11 -07:00
Claude Code
aa4c3ca641 test(csam): Add positive test cases for CSAM validation/parsing in positives.jsonl
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:10 -07:00
Claude Code
bc1161befa docs(contact-info): 📝 Update positive contact info dataset examples for training/testing validation
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:10 -07:00
Claude Code
37152e725f test(consent-violation): Add positive test cases for valid consent violation scenarios in positives.jsonl
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:09 -07:00
Claude Code
b07c520397 docs(bestiality-specific): 📝 Update positive entries in bestiality-themed dataset for AI/user outputs
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:09 -07:00
Claude Code
729de75eb7 chore(bdsm): 🔧 Update positive BDSM dataset examples for improved training and moderation
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:08 -07:00
Claude Code
f2be8dae35 chore(anti-trans): 🔧 Update anti-trans content dataset with new positive examples for model training/validation
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:08 -07:00
Claude Code
cd99eb3221 chore(age-play): 🔧 Update positive examples dataset for age-related content training/validation
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:07 -07:00
Claude Code
8cbdbaaee4 feat(adult-content): Add 50 new positive examples to dataset for improved training coverage
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 15:33:06 -07:00
Claude Code
5febd890d8 chore(watersports): 🔧 Refresh positives.jsonl dataset with updated positive examples for watersports tasks
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 14:09:05 -07:00
Claude Code
ec354f8949 chore(trafficking): 🔧 Update trafficking dataset with expanded positive examples for training/classification
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 14:09:05 -07:00
Claude Code
8903c7a4b2 security(threats): 🔒️ Add/upgrade threat entries in positives.jsonl for enhanced detection accuracy
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 14:09:04 -07:00
Claude Code
84b3c820f6 docs(spam-specific): 📝 Update positive spam examples in training dataset
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 14:09:04 -07:00
Claude Code
506c232cdb chore(solicitation): 🔧 Update positive solicitation test cases in positives.jsonl
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 14:09:03 -07:00
Claude Code
77215b9fe1 chore(snuff): 🔧 Update labeled examples in snuff dataset with new positive entries
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 14:09:03 -07:00
Claude Code
c31c065b7b docs(sextortion): 📝 Update positive sextortion examples in positives.jsonl for dataset training/security analysis
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 14:09:02 -07:00
Claude Code
a30ffb0419 db(self-harm): 🗃️ Add refined positive examples for self-harm detection training
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 14:09:02 -07:00
Claude Code
817a33e3ac chore(scat-specific): 🔧 Update positive examples in scat dataset with new training/test data entries
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 14:09:01 -07:00
Claude Code
ddb0b0d77f feat(scam-patterns): Add expanded positive scam pattern examples for fraud detection training
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 14:09:01 -07:00
Claude Code
dc13c2e6fa feat(roleplay): Add positive roleplay examples in JSONL format for scenarios, prompts, and responses
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 14:09:00 -07:00
Claude Code
2282b13e82 feat(profanity-specific): Update offensive terms in profanity positives list for improved moderation accuracy
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 14:09:00 -07:00
Claude Code
c1b0ae77bc docs(predatory-behavior): 📝 Update positive examples for predatory behavior dataset with new patterns and corrected labels
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-26 14:08:59 -07:00