![]() ![]() New collections can also be bought with real money. ![]() For each completed puzzle the player earns gold which is the in-game currency and can be used to acquire new collections. Each collection consists of a certain number of puzzles with three available from the start while the rest has to be unlocked by completing previous ones. Most of these need to be paid for but there are also some that are free. The jigsaws are sorted into themed collections and when starting out the player has access to two of these but there are more available for download. Therefore, further improvements on P3 could have a significant impact on many program synthesis areas.Microsoft Jigsaw is free-to-play collection of jigsaw puzzles. In a small user study, we find a positive correlation between puzzle-solving performance and coding experience, and between the puzzle difficulty for humans and AI solvers. Codex performs best, solving up to 18% of 397 test problems with a single try and 80% of the problems with 1,000 tries per problem. ![]() We develop baseline enumerative program synthesis, GPT-3 and Codex solvers that are capable of solving puzzles-even without access to any reference solutions-by learning from their own past solutions. The dataset is comprehensive in that it spans problems of a range of difficulties and domains, ranging from trivial string manipulation problems, to classic programming puzzles (e.g., Tower of Hanoi), to interview/competitive-programming problems (e.g., dynamic programming), to longstanding open problems in algorithms and mathematics (e.g., factoring). They do not require an answer key or input/output examples, nor do they depend on natural language understanding. The puzzles are objective in that each one is specified entirely by the source code of its verifier $f$, so evaluating $f$ is all that is needed to test a candidate solution. Each puzzle is defined by a short Python program $f$, and the goal is to find an input which makes $f$ return True. Abstract: We introduce a new type of programming challenge called programming puzzles, as an objective and comprehensive evaluation of program synthesis, and release an open-source dataset of Python Programming Puzzles (P3). ![]()
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |