A Python Interpreter Written in Python

(aosabook.org)

87 points | by xk3 4 days ago ago

23 comments

BoppreH 3 hours ago ago

> Byterun is a Python interpreter written in Python. This may strike you as odd, but it's no more odd than writing a C compiler in C.
I'm not so sure. The difference between a self-hosted compiler and a circular interpreter is that the compiler has a binary artifact that you can store.
With an interpreter, you still need some binary to run your interpreter, which will probably be CPython, making the new interpreter redundant. And if you add a language feature to the custom interpreter, and you want to use that feature in the interpreter itself, you need to run the whole chain at runtime: CPython -> Old Interpreter That Understand New Feature -> New Interpreter That Uses New Feature -> Target Program. And the chain only gets longer, each iteration exponentially slower.
Meanwhile with a self-hosted compiler, each iteration is "cached" in the form a compiled binary. The chain is only in the history of the binary, not part of the runtime.
---
Edit since this is now a top comment: I'm not complaining about the project! Interpreters are cool, and this is genuinely useful for learning and experimentation. It's also nice to demystify our tools.

[-]
- gwerbin 2 hours ago ago
  
  PyPy handled this by implementing PyPy in a restricted minimal subset of Python that they called RPython, and that seemed to work out well for them.
- SJC_Hacker an hour ago ago
  
  This is the case only if the new interpreter does not simply include the layer that the old interpreter has for translating bytecode to native instructions. Once you have that, you can simply bootstrap any new interpreters from previous ones. Even in the case of supporting new architectures, you can still work at the Python level to produce the necessary binary, although the initial build would have to be done on an already supported architechture.
anitil 7 hours ago ago

Oooh it's a bytecode interpreter! I was wondering how they'd fit a parser/tokenizer in 500 lines unless the first was `import tokenizer, parser`. And it looks like 1500ish lines according to tokei
I think because python is a stack-based interpreter this is a really great way to get some exposure to how it works if you're not too familiar with C. A nice project!
blueybingo 4 hours ago ago

the article glosses over something worth pausing on: the `getattr` trick for dispatching instructions (replacing the big if-elif chain) is actaully a really elegant pattern that shows up in a lot of real interpreters and command dispatchers, not just toy ones -- worth studying that bit specifically if you're building anything with extensible command sets.
vachanmn123 3 hours ago ago

Very well written! Everyone used to tell me during Uni that stacks are used for running programs, never ACTUALLY understood where or how.
tekknolagi 7 hours ago ago

See also https://github.com/nedbat/byterun and https://github.com/rocky/x-python

[-]
- bjoli 7 hours ago ago
  
  And, in some ways, PyPy. I still think it is the sanest way to implement Python.
  It makes me sad that I have to write C to make any meaningful changes to Python. Same goes for ruby. Rubinius was such a nice project.
  Hacking on schemes and lisps made me realize how much more fun it is when the language is implemented in the language itself. It also makes sure you have the right abstractions for solving a bunch of real problems.
  
  [-]
  - anitil 7 hours ago ago
    
    > And, in some ways, PyPy
    What do you mean by that? I'm not familiar with PyPy
    
    [-]
    - nxpnsv 7 hours ago ago
      
      PyPy is python implemented in python. It is fast.
      
      [-]
      - notpushkin 6 hours ago ago
        
        https://pypy.org/
        It lags behind CPython in features and currently only supports Python versions up to 3.11. There was a big discussion a month ago: https://news.ycombinator.com/item?id=47293415
        But you can help! https://pypy.org/howtohelp.html
        https://opencollective.com/pypy
      - Doxin 6 hours ago ago
        
        PyPy is python implemented in RPython, which is technically a python subset. It's so restricted it might as well be a different language though.
        
        [-]
        
        bjoli 5 hours ago ago
        
        It is restricted in a way that you would restrict yourself to write high speed software in most languages, and I found it is not that restrictive compared to C that you would have to use if you were to write a fast Python library.
        
        [-]
        
        Doxin 4 hours ago ago
        
        oh for sure, but I still feel like telling people pypy is written in python is misleading. it's written in something significantly like python, but it's not python.
        
        mjmas 3 hours ago ago
        
        > technically a python subset
        So it can just run under CPython? If so, then that isn't too misleading.
        
        [-]
        
        bjoli 2 hours ago ago
        
        Yes. It can run under Cpython (2.7).
  - actionfromafar 4 hours ago ago
    
    Well, one could rewrite Python (perhaps piece by piece?) in Shedskin.
    Shedskin is very nearly Python compatible, one could say it is an implementation of Python.
gield an hour ago ago

(2012)
woadwarrior01 5 hours ago ago

aka A Metacircular Interpreter

[-]
- mapontosevenths an hour ago ago
  
  Do you think God stays in heaven because he too lives in fear of what he's created?
andltsemi3 4 hours ago ago

"Yaw dog I heard you liked python, so I put python in your python so you can interpret python while you interpret python"
hcfman 6 hours ago ago

Just wondering why you stopped there? Why not a python interpreter for a python interpreter for python ?

[-]
- dnnddidiej 5 hours ago ago
  
  It already is that.