Commit graph

72 commits

Author SHA1 Message Date
Zafir Stojanovski
256eb71555 use get_data_file_path to read file contents 2025-02-06 10:12:51 +01:00
Zafir Stojanovski
7827025f7b docs 2025-02-06 00:12:58 +01:00
Zafir Stojanovski
b4f412956e group anagrams env 2025-02-06 00:11:07 +01:00
Andreas Köpf
c6fbff7d8f
Merge pull request #52 from cavit99/main
Improve Word Ladder and add complete example suite
2025-02-03 15:16:29 +01:00
Cavit Erginsoy
aff0fecef4 lint 2025-02-03 11:35:30 +00:00
Cavit Erginsoy
9b1068ea39 Merge remote-tracking branch 'upstream/main' 2025-02-03 07:44:32 +00:00
Cavit Erginsoy
d5065955a8 Refactor word ladder generation with improved validation and graph-based path finding
- Enhanced configuration validation with size and length constraints
- Implemented graph-based neighbor computation and caching
- Simplified path finding algorithm with more robust length checking
- Added more flexible word set loading with configurable length ranges
- Improved error handling for dataset generation
2025-02-03 07:21:43 +00:00
Joe Norton
731d36f43f add palindrome score_answer
add palindrome score_answer & test
2025-02-02 18:04:47 -08:00
Cavit Erginsoy
da11def5c7 Improve efficiency and reduce plural bias in word ladder generation
- Precomputed sorted word lists for each word length (stored in self.words_lists) to avoid redundant sorting on every _generate_word_pair call.
- Updated _generate_word_pair to utilize the cached sorted list, significantly improving computational efficiency.
- Implemented weighted random sampling for 5-letter words, giving words ending with 'S' a lower weight (0.5) to reduce bias without completely filtering them out.
2025-02-01 14:37:21 +00:00
Cavit Erginsoy
7e29fc12e1 refactor: Clarify word ladder question 2025-02-01 14:27:06 +00:00
Joe Norton
d0d84ae82a lint 2025-01-31 18:45:52 -08:00
Joe Norton
f75d9a2829 add palindrome_generation 2025-01-31 18:45:52 -08:00
Andreas Koepf
b49167c61c lint 2025-01-31 12:16:08 +01:00
Andreas Koepf (aider)
7d911a8c25 fix: Improve base conversion logic for non-standard bases 2025-01-31 12:09:32 +01:00
Andreas Koepf
a577f7cdf6 use sorted() for repeatable generation outputs (e.g. GALLERY.md) 2025-01-30 23:33:43 +01:00
Andreas Koepf
5ae329becd lint 2025-01-30 23:14:32 +01:00
Cavit Erginsoy
df3c4580ee INIT 2025-01-30 21:32:46 +00:00
Cavit Erginsoy
d58dfd0bd4 INIT 2025-01-30 19:42:58 +00:00
Andreas Koepf
cca03d4b09 add newline to word sorting template 2025-01-27 16:57:49 +01:00
Andreas Koepf (aider)
f91417aaa9 feat: Clarify word sorting instructions with ASCII/Unicode ordering and output format 2025-01-26 22:29:57 +01:00
Andreas Koepf (aider)
9bb5a94c29 refactor: Change word sorting answer format from list string to comma-separated string 2025-01-26 22:23:18 +01:00
Andreas Koepf
cae7f0f98b min python 3.11 to support StrEnum 2025-01-26 22:17:43 +01:00
Andreas Koepf
ecbb155184 fix unit tests, lower python dependency to 3.9 2025-01-26 16:55:17 +01:00
Andreas Koepf (aider)
a2f9bf0535 fix: Import missing 're' module for regex word extraction 2025-01-26 16:14:23 +01:00
Andreas Koepf (aider)
f75471119c feat: Add word sorting task generation with text transformations 2025-01-26 16:14:10 +01:00
Andreas Koepf (aider)
648ad26bd3 feat: Add word sorting dataset with configurable text transformations 2025-01-26 16:11:32 +01:00
Andreas Koepf (aider)
49533e10ad refactor: Update default sentence length constraints to 3-20 words 2025-01-26 15:57:02 +01:00
Andreas Koepf (aider)
b03b02f3a5 refactor: Update sentence extraction regex to preserve ending punctuation 2025-01-26 15:56:03 +01:00
Andreas Koepf (aider)
fa9465e6a9 refactor: Rename num_of_words_in_sentence and add max_words_in_sentence config 2025-01-26 15:46:21 +01:00
Andreas Koepf
9abea574ed refactor: Update sentence reordering prompt to be more descriptive 2025-01-26 15:46:19 +01:00
Andreas Köpf
684a69603f
Merge branch 'main' into koko/scramble 2025-01-26 15:41:25 +01:00
Andreas Koepf
7977895275 rename word_reversal.py -> word_sequence_reversal.py 2025-01-26 11:57:50 +01:00
Andreas Koepf (aider)
cf864d523a refactor: Update import for word sequence reversal module 2025-01-26 11:53:48 +01:00
Andreas Koepf (aider)
ae79483609 fix: Correct WordReversalConfig references to WordSequenceReversalConfig 2025-01-26 11:52:25 +01:00
Andreas Koepf (aider)
5e57848fa2 refactor: Rename WordReversalDataset to WordSequenceReversalDataset 2025-01-26 11:52:15 +01:00
Andreas Koepf (aider)
6811c2ba04 feat: Add SpellBackward imports and exports to algorithmic package 2025-01-26 11:48:18 +01:00
Andreas Koepf (aider)
71a6800d27 feat: Add spell_backward.py module for word reversal task generation 2025-01-26 11:46:07 +01:00
Andreas Koepf (aider)
4908c06cb8 refactor: Move SpellBackwardDataset to separate file 2025-01-26 11:44:27 +01:00
Andreas Koepf (aider)
65d5a5786b feat: Add SpellBackwardDataset with word reversal and length filtering 2025-01-26 11:40:47 +01:00
abdulhakeem
03d4a5d8ac Make more tiny correction 2025-01-25 23:25:55 -06:00
abdulhakeem
acc3f5269a Correct logic for number of words in sentence 2025-01-25 23:22:16 -06:00
abdulhakeem
05004b7a51 Add parameters to _generate_sentence_dataset 2025-01-25 23:17:39 -06:00
abdulhakeem
9d025b43fa Ensure only words are considered 2025-01-25 23:08:41 -06:00
abdulhakeem
d00856cab4 Add assertion to ensure number of words in sentence is positive 2025-01-25 23:02:17 -06:00
abdulhakeem
09634db2cf Add sentence reordering and unit tests to validate it 2025-01-25 22:52:35 -06:00
Andreas Koepf
31a5b5cb76 formatting 2025-01-25 18:51:28 +01:00
Andreas Koepf
5fc0b1bdc3 remove old files 2025-01-25 18:51:07 +01:00
Andreas Koepf (aider)
7c61924335 refactor: Rename UnscrambleWordsDataset to LetterJumbleDataset 2025-01-25 18:37:42 +01:00
Andreas Koepf (aider)
e60d211043 feat: Add consecutive words option and ensure minimum word swap in UnscrambleWords 2025-01-25 18:29:02 +01:00
Andreas Koepf (aider)
f531d6b513 feat: Add unscramble_words dataset with configurable word scrambling 2025-01-25 18:21:31 +01:00