This is an old revision of the document!

nodeMCU Unofficial FAQ

* * * Work in Progress * * *

I have started a thread on the ESP8266 forum, Discussions on my nodeMCU Lua unofficial FAQ. Please use this to discuss any issues that you have with this FAQ; any areas where you feel that the explanation is unclear or needs further expansion; or any or Qs that you feel need answering and would help others if they were included here. Thank-you. Terry Ellison

What is this FAQ for?

This FAQ does not aim to help you to learn to program or even how to program in Lua. There are plenty of resources on the Internet for this, some of which are listed in Where to start . What this FAQ does is to answer some of the common questions that a competent Lua developer would ask in learning how to develop Lua applications for the ESP8266 based boards running the nodeMCU firmware.

Lua Language

Where to start

The nodeMCU firmware implements Lua 5.1 over the Espressif SDK for its ESP8266 SoC and the IoT modules based on this.

The official lua.org Lua Language specification gives a terse but complete language specification.
Its FAQ provides information on Lua availability and licensing issues.
The unofficial Lua FAQ provides a lot of useful Q and A content.
The Lua User's Wiki gives useful example source and relevant discussion. In particular, its Lua Learning Lua section is a good place to start learning Lua.
- The best book to learn Lua is Programming in Lua by Roberto Ierusalimschy, who is one of the creators of Lua. It's first edition is available free online . The second edition was aimed at Lua 5.1, but is out of print. The third edition is still in print and available in paperback. It contains a lot more material and clearly identifies Lua 5.1 vs Lua 5.2 differences. This third is widely available for purchase and probably the best value for money.
- The Espressif ESP8266 architecture is closed source, but the Espressif SDK itself is continually being updated so the best way to get the documentation for this is to google Espressif IoT SDK Programming Guide or to look at the Espressif downloads forum .
- The nodeMCU documentation is available online. However, please remember that the development team are based in China, and English is a second language, so the documentation needs expanding and be could improved with technical proofing.
- As with all Open Source projects the source for the nodeMCU firmware is openly available on the GitHub nodemcu-firmware repository.
How is nodeMCU Lua different to standard Lua?

nodeMCU Lua in an implementation of eLua over the ESP8266 SDK. eLua is a full-featured implementation of Lua 5.1 that is optimized for embedded system development and execution to provide a scripting framework that can be used to deliver useful applications within the limited RAM and Flash memory resources of embedded processors such as the ESP8266.

A key goal of eLua is to reduce the RAM requirements for the Lua runtime system. One of the key techniques used in this implementation is to use read-only tables and constants wherever practical for library modules. On a typical build this approach reduces the RAM footprint by some 20-25KB and this makes a Lua implementation for the ESP8266 feasible. This technique is called LTR and this is documented in detail in an eLua technical paper: Lua Tiny RAM .

The Espressif SDK is the interface that is freely available albeit in closed format to developers building applications for the ESP8266. The nodeMCU eLua implementation must therefore use this SDK as its kernel layer and work within any design constraints that the SDK API imposes. In particular, the SDK employs an event and task-oriented structure, where individual events can trigger an associated task; this task then runs to completion uninterrupted, at which point the next event queued can be initiated. (Note that the SDK contain device drivers which are interrupt driven. However, these are internal to the SDK, so it treats all event triggered application tasks as atomic.)

The API calls for each type of event typically use a callback parameter to bind a C function implementing a given task to a given event. In the case of the nodeMCU Lua implementation, this task is wrapper around a developer-provided Lua function. This event-driven model imposed by the SDK is very different to a conventional procedural implementation of Lua. Some standard Lua modules and eLua platform modules don't fit well within this structure, and so the nodeMCU implementation replaces these by ESP8266-specific versions. For example, the standard io and os libraries don't work, but have been largely replaced by the nodeMCU node and file libraries.

The debug and math libraries have also been omitted to reduce the runtime footprint.

ESP8266 Specifics

How is coding for the ESP8266 the same as standard Lua?

This is a fully featured Lua 5.1 implementation so all standard Lua language constructs and data types work.
The main standard Lua libraries – core, coroutine, string and table are implemented.

How is coding for the ESP8266 different to standard Lua?

The ESP8266 use onchip RAM and offchip Flash memory connected using a dedicated SPI interface. Both of these are very limited (when compared to systems than most application programmer use). The SDK and the Lua firmware already use the majority of this resource: the later build versions keep adding useful functionality, and unfortunately at an increased RAM and Flash cost, so depending on the build version and the number of modules installed the runtime can have as little as 17KB RAM and 40KB Flash available at an application level. This Flash memory is formatted an made available as a SPI Flash File System (SPIFFS) through the file library.
However, if you choose to use a custom build, for example one which uses integer arithmetic instead of floating point, and which omits libraries that aren't needed for your application, then this can help a lot doubling these available resources. (See Marcel Stör's excellent custom build tool that he discusses in this forum topic). Even so, those developers who are used to dealing in MB or GB of RAM and file systems can easily run out of these resources. Some of the techniques discussed below can go a long way to mitigate this issue.
Current versions of the ESP8266 run the SDK over the native hardware so there is no underlying operating system to capture errors and to provide graceful failure modes, so system or application errors can easily “PANIC” the system causing it to reboot. Error handling has been kept simple to save on the limited code space, and this exacerbates this tendency. Running out of a system resource such as RAM will invariably cause a messy failure and system reboot.
There is currently no debug library support. So you have to use 1980s-style “binary-chop” to locate errors and use print statement diagnostics though the systems uart interface. (This omission was largely because of the Flash memory footprint of this library, but there is no reason in principle why we couldn't make this library available in the near future as an custom build option).
The LTR implementation means that you can't easily extend standard libraries as you can in normal Lua, so for example an attempt to define function table.pack() will cause a runtime error because you can't write to the global table. (Yes, there are standard sand-boxing techniques to achieve the same effect by using metatable based inheritance, but if you try to use this type of approach within a real application, then you will find that you run out of RAM before you implement anything useful.)
- There are standard libraries to provide access to the various hardware options supported by the hardware: WiFi, GPIO, One-wire, I²C, SPI, ADC, PWM, UART, etc.
- The runtime system runs in interactive-mode. In this mode it first executes any init.lua script. It then “listens” to the serial port for input Lua chunks, and executes them once syntactically complete. There is no luac or batch support, although automated embedded processing is normally achieved by setting up the necessary event triggers in the init.lua script.
- The various libraries (net, tmr, wifi, etc.) use the SDK callback mechanism to bind Lua processing to individual events (for example a timer alarm firing). Developers should make full use of these events to keep Lua execution sequences short. If any individual task takes too long to execute then other queued tasks can time-out and bad things start to happen.
- Non-Lua processing (e.g. network functions) will usually only take place once the current Lua chunk has completed execution. So any network calls should be viewed at an asynchronous request. A common coding mistake is to assume that they are synchronous, that is if two socket:send() are on consecutive lines in a Lua programme, then the first has completed by the time the second is executed. This is wrong. Each socket:send() request simply queues the send operation for dispatch. Neither will start to process until the Lua code has return to is calling C function. Stacking up such requests in a single Lua task function burns scarce RAM and can trigger a PANIC. This true for timer, network, and other callbacks. It is even the case for actions such as requesting a system restart, as can be seen by the following example:
<code Lua> node.restart(); for i = 1, 20 do print(“not quite yet – ”,i); end </code>

So how does the SDK event / tasking system work in Lua?

Any SDK-based application for the ESP8266 uses a startup hook void userinit(void) defined by convention in the C module usermain.c. The system invokes this hook on boot. The user_init() function can by used to do any initialisation required and to call the necessary timer alarms or system functions to bind and callback routines to implement the tasks needed in response to any system events. Individual task callbacks need to implement their actions and return control to the SDK as soon as practical, as the SDK framework is not pre-emptive so any further event tasks are queued on a pending list within the SDK kernel.
Excessively long-running tasks can therefore cause other system functions and services to timeout, or allocate memory to buffer queued data, which can then trigger either the watchdog timer or memory exhaustion, both of which will ultimately cause the system to reboot.

SDK Callbacks include:

Timer alarm callbacks
Wifi scan callbacks
Network (ESPCONN) callbacks for connection, disconnect, send, receive, etc. (roughly equivalent the socket:on() callbacks in Lua)
GPIO and other hardware related interrupts.

The eLua implementation sits within this framework:

app/user/usermain.c contains the userinit() entry point. This reinitialises the UART, the volatile sections of flash memory (if necessary), the RomFS and SPIFFS before calling luamain() with the command-line lua -i. * The Lua RTS (see app/lua/lua.c) then sets up a timer to poll the input UART every 80 mSec to assemble a complete execution chunk which it then executes with a luapcall(). * The running Lua script can initialise one or more callbacks associated with events such as a timer. The module code will typically store the link to this Lua callback function in the Lua registry . When the callback hook is subsequently invoked, this hook code then retrieves this function reference from the registry and executes it with a luacall(). * There are no concurrency or interlock issues with this approach as the SDK will only initiate a callback after the previously running task has completed, and in the case of Lua when the previous Lua chunk has completed – Lua chunks are executed one-at-a-time. Consider an simple telnet example given in examples/fragment.lua: <code Lua> s=net.createServer(net.TCP) s:listen(23,function© constd = c

function soutput(str) if(constd~=nil) then

     con_std:send(str) 
   end 
 end 
 node.output(s_output, 0) 
 c:on("receive",function(c,l) node.input(l) end) 
 c:on("disconnection",function(c) 
   con_std = nil 
   node.output(nil) 
 end)

end) </code> This example doesn't use upvalues and all declarations are global, so we can reorder this code for clarity (though doing this adds a few extra globals):

 function c_receive(c,l) 
   node.input(l) 
 end
 function c_disconnection(c) 
   con_std = nil 
   node.output(nil) 
 end
 function s_output(str) 
   if(con_std~=nil) then 
     con_std:send(str) 
   end 
 end
 function s_listen(c) 
   con_std = c 
   node.output(s_output, 0) 
   c:on("receive",c_receive) 
   c:on("disconnection",c_disconnection) 
 end
 s=net.createServer(net.TCP) 
 s:listen(23,s_listen)

So let us consider how this is executed:

The main routine executes defining 4 functions in the global variables: creceive, cdisconnection, soutput, slisten; the server s is bound to port 23 registering slisten as the initialisation callback. The main routine then exits, with the global variables retained and the main routine code garbage collected. * When another computer connects to port 23, the listener handler retrieves the reference to slisten from the registry and calls it with the socket parameter. This function then binds soutput to the node.output hook registering it in the registry, and likewise the creceive and cdisconnection are bound and registered to the respective on handlers. We now have four routines registered in the registry associated with four events, and this routine then exits with only the routines execution frame garbage collected. * When a record is received, the onreceive handler retrieves the reference to creceive from the registry and calls it passing it the record. This routine then passes this to the node.input() and exits. (The node input handler marshals these records into a complete Lua chunk). * The node.input handler is polling on an 80 mSec alarm and if a compete Lua chunk is available, it executes it. Any output is then passed to the note.output handler which retrieves and calls soutput which exits on completion. Any pending sends are then processed. * This cycle repeats until the other computer disconnect which triggers the ondisconnect handler. This retrieves the cdisconnection reference from the registry and calls it. This routine dereferences the connected socket and closes the node.output hook and exits returning control to the disconnect handler which garbage collects any associated sockets and registered on handlers.

The SDK can and will often schedule other event tasks in between these Lua executions (e.g. to do the actual TCP stack processing). The longest individual Lua execution in this example is only 20 bytecode instructions (in the main routine). The original version was a few instructions shorter in that temporary locals were used to hold the closure references instead of globals, but the runtime and memory footprint aren't materially different.

Understanding how the system executes your code can help you structure it better and improve memory usage. Each event task is established by a callback in an API call in an earlier task.

So how is context passed between Lua event tasks?

It is important to understand that any event callback task is associated with a single Lua function. This function is executed from the relevant nodeMCU library C code using a luacall(). Even system initialisation which executes the dofile(“init.lua”) can be treated as a special case of this. Each function can invoke other functions and so on, but it must ultimate return control to the C library code. * By their very nature Lua local variables only exit within the context of an executing Lua function, and so all locals are destroyed between these luacall() actions. No locals are retained across events.
So context can only be passed between event routines by one of three mechanisms:
- Globals are by nature globally accessible. Any global will persist until explicitly dereference by reassigning nil to it. Globals can be readily enumerated by a for k,v in pairs(_G) do so their use is transparent.
- Upvalues are the Lua mechanism for implementing inheritance during a function closure. The Lua runtime system does behind-the-scenes magic to remap any upvalues into a hidden linked list when the calling stack frame is exited, so they just work as you would expect them to. The hidden values will be garbage collected correctly when fully dereferenced. However, some API calls don't correctly dereference expired callback references and as a result upvalues may not be correctly garbage collected and manifest themselves as memory leaks. So using them can cause more frequent and difficult to diagnose PANICs during testing. So my general recommendation is to stick to globals for this specific usecase of passing context between event callbacks, and nil them when done.
- The File system is a special case of persistent global, so there is no reason why it can't be used to pass context, in principle. However flash memory has a limited write cycle lifetime, so it is best to limit using file content that frequently changes to a mechanism of last resort.

So how is the Lua Registry used and why is this important?

So all Lua callbacks are called by C wrapper functions that are themselves callback activated by the SDK as a result of a given event. Such C wrapper functions themselves frequently need to store state for passing between calls or to other wrapper C functions. The Lua registry is simply another Lua table which is used for this purpose, except that it is hidden from direct Lua access. Any content that needs to be saved is created with a unique key. Using a standard Lua table enables standard garbage collection algorithms to operate on its content.

Note that we have identified a number of cases where library code does not correctly clean up Registry content when closing out an action, leading to memory leaks.

When and why should I avoid using tmr.delay()?

If you are used coding in a procedural paradigm then it is understandable that you consider using tmr.delay() to time sequence your application. However as discussed in the previous section, with nodeMCU Lua you are coding in an event-driven paradigm.
If you look at the app/modules/tmr.c code for this function, then you will see that it executes a low level etsdelayus(delay). This function isn't part of the nodeMCU code or the SDK; it's actually part of the xtensa-lx106 boot ROM, and is a simple timing loop which polls against the internal CPU clock. It does this with interrupts disabled, because if they are enabled then there is no guarantee that the delay will be as requested.

tmr.delay() should be correctly used if you want to have exact timing control on an external hardware I/O (e.g. lifting a GPIO pin high for 20 μSec). It will achieve no functional purpose in pretty much every other usecase, as any other system code-based activity will be blocked from execution; at worst it will break your the code and create hard-to-diagnose timeout errors. A good indication here is if you want a delay of more than 10 mSec or so, then using tmr.delay() is the wrong approach. You should be using a timer alarm or other library callback, to allow the other processing to take place. As the nodeMCU documentation correctly advises (translating Chinese English in to English): tmr.delay() will make the CPU work in non-interrupt mode, so other instructions and interrupts will be blocked. Take care in using this function.

How do I avoid a PANIC loop in init.lua?

Most of us have fallen into the trap of creating an init.lua that has a bug in it, which then causes the system to reboot and hence gets stuck in a reboot loop. If you haven't then you probably will do so at least once.

When this happens, the only robust solution is to reflash the firmware.
The simplest way to avoid having to do this is to keep the init.lua as simple as possible – say configure the wifi and then start your app on a one-shot tmr.alarm() after a 2-3 sec delay. This delay is long enough to issue a file.remove(“init.lua”) through the serial port and recover control that way.
Also always test any new init.lua by creating it as inittest.lua, say, and manually issuing a dofile(“inittest.lua”) through the serial port, and then only rename it when you are certain it is working as you require.

Techniques for Reducing RAM and SPIFFS footprint

How do I reduce the size of my compiled code?

Note that there are two methods of saving compiled Lua to SPIFFS:

The first is to use node.compile() on the .lua source file, which generates the equivalent bytecode .lc file. This approach strips out all the debug line and variable information.
The second is to use loadfile() to load the source file into memory, followed by string.dump() to convert it in-memory to a serialised load format which can then be written back to a .lc file. This approach creates a bytecode file which retains the debug information.

The memory footprint the bytecode created by method (2) is the same as when executing source files directly, but the footprint of bytecode created by method (1) is typically 60% of this size, because the debug information is almost as large as the code itself. So using .lc files generated by node.compile() considerably reduces code size in memory – albeit with the downside that any runtime errors are extremely limited.

In general consider method (1) if you have stable production code that you want to run in as low a RAM footprint as possible. Yes, method (2) can be used if you are still debugging, but you will probably be changing this code quite frequently, so it is easier to stick with .lua files for code that you are still developing.

Note that if you use require(“XXX”) to load your code then this will automatically search for XXX.lc then XXX.lua so you don't need to include the conditional logic to load the bytecode version if it exists, falling back to the source version otherwise.

How do I get a feel for how much memory my functions use?

Given the limited resources available to applications it is highly desirable that by you understand the VM model. The essential reference here is A No Frills Introduction to Lua 5.1 VM Instructions . This explain how the code generator works, how much memory overhead is involved with each table, function, string etc..
You can't easily get a bytecode listing of your ESP8266 code; however there are two broad options for doing this:
- Generate a bytecode listing on your development PC. The Lua 5.1 code generator is basically the same on the PC and on the ESP8266, so whilst it isn't identical, using the standard Lua batch compiler luac against your source on your PC with the -l -s option will give you a good idea of what your code will generate. The main difference between these two variants is the sizet for ESP8266 is 4 bytes rather than 8bytes found on modern 64bit development PCs; and the eLua variants generate different access references for ROM data types. If you want to see what the string.dump() version generates then drop the -s option to retain the debug information. * Upload your .lc files to the PC and disassemble then there. There are a number of Lua code disassemblers which can list off the compiled code that you application modules will generate, if you have a script to upload files from your ESP8266 to your development PC. I use ChunkySpy which can be downloaded here , but you will need to apply the following patch so that ChunkySpy understands eLua data types: <code diff> — a/ChunkSpy-0.9.8/5.1/ChunkSpy.lua 2015-05-04 12:39:01.267975498 +0100 +++ b/ChunkSpy-0.9.8/5.1/ChunkSpy.lua 2015-05-04 12:35:59.623983095 +0100 @@ -2193,6 +2193,9 @@ config.AUTODETECT = true elseif a == “–brief” then config.DISPLAYBRIEF = true + elseif a == “–elua” then + config.LUATNUMBER = 5
config.LUATSTRING = 6 elseif a == “–interact” then perform = ChunkSpyInteract </code>
- Your other great friend is to use node.heap() regularly through your code.
- Use these tools and play with coding approaches to see how many instructions each typical line of code takes in your coding style. The Lua Wiki gives some general optimisation tips, but in general just remember that these focus on optimising for execution speed and you will be interested mainly in optimising for code and variable space as these are what consumes precious RAM.

What is the cost of using functions?

Consider the output of dofile(“test1a.lua”) on the following code compared to the equivalent where the function pnh() is removed and the extra print(heap()) statement is placed inline:

-- test1b.lua
collectgarbage()
local heap = node.heap
print(heap())
local function pnh() print(heap()) end
pnh()
print(heap())

Heap Value	Function Call	Inline
1	20712	21064
2	20624	21024
3	20576	21024

Here bigger means less RAM used.
Of course you should still use functions to structure your code and encapsulate common repeated processing, but just bear in mind that each function definition has a relatively high overhead for its header record and stack frame (compared to the 20 odd KB RAM available). So try to avoid overusing functions. If there are less than a dozen or so lines in the function then you should consider putting this code inline if it makes sense to do so.

How do I minimise the footprint of an application on the file system

It is possible to write Lua code in a very compact format which is very dense in terms of functionality per KB of source code.
However if you do this then you will also find it extremely difficult to debug or maintain your application.
A good compromise is to use a tool such as LuaSrcDiet, which you can use to compact production code for downloading to the ESP8266:
- Keep a master repository of your code on your PC or a cloud-based versioning repository such as GitHub
- Lay it out and comment it for ease of maintenance and debugging
  - Use a package use as Esplorer to download modules that you are debugging and to test them.
  - Once the code is tested and stable, then compress it using LuaSrcDiet before downloading to the ESP8266. Doing this will reduce the code footprint on the SPIFFS by 2-3x.
  - Consider using node.compile() to pre-compile any production code. This removes the debug information from the compiled code reducing its size by roughly 40%. (However this is still perhaps 1.5-2x larger than a LuaSrcDiet-compressed source format, so if SPIFFS is tight then you might consider leaving less frequently run modules in Lua format. If you do a compilation, then you should consider removing the Lua source copy from file system as there's little point in keeping both on the ESP8266.
  - If you are developing applications which are multi-tiered, for example if your ESP8266s are logically connected to a RasberryPi server, then strip down the functionality on the ESP8266 to a minimum and use the RPi with its 2Gb RAM and full Linux OS as the vehicle for doing all end-user device communication and validation.
  How do I minimise the footprint of running application
The Lua Garbage collector is very aggressive at scanning and recovering dead resources. It use an incremental mark-and-sweep strategy which means that any data which is not ultimately referenced back to the Globals table, the Lua registry or in-scope local variables in the current Lua code will be collected.
Setting any variable to nil dereferences the previous context of that variable. (Note that reference-based variables such as tables, strings and functions can have multiple variables referencing the same object, but once the last reference has set to nil, the collector will recover the storage.
Unlike other compile-on-load languages such as PHP, Lua compiled code is tread the same way as any other variable type when it comes to garbage collection and can be collected when fully dereferenced, so that the code-space can be reused.
This strong dispose on dereference feature coupled with the fact that Lua execution is intrinsically divided into separate event tasks each associated with a Lua callback, means that it is very easy to structure your application using an classic technique which dates back to the 1950s know as Overlays.
There are various approaches to implementing this. One is described by DP Whittaker in his Massive memory optimization: flash functions topic. Another is to use volatile modules. There are standard Lua templates for creating modules, but the require() functions creates a reference for the loaded module in the package.loaded table, and this reference prevents the module being garbage collected. To make a module volatile, you should remove this reference by setting it to nil. You can't do this in the outermost level of the module (since the reference is only created once execution has returned from the module code), but you can in any module function, and typically an initialisation function for the module, as in the following example: <code Lua>
1. - . . .
local s=net.createServer(net.TCP) s:listen(80,function© (require(“connector”)).init© end) </code>
connector.lua would be a standard module pattern except that the M.init() routine must include the lines <code Lua> local M, module = {}, …
1. - . . .
function M.init(csocket) package.loaded[module]=nil
1. - . . .
end
1. - . . .
return M </code>
This approach ensures that the module can be fully dereferenced on completion. OK, in this case, it also means that it has to be reloaded on each TCP connection to port 80, but loading a compiled module from SPIFFS only takes a few mSec, so surely this is an acceptable overhead if it enables you to break down your application into RAM-sized chunks. Recall that require() will automatically search for connector.lc followed by connector.lua, so the code will work for both source and compiled variants.

What other resources are available

Install lua and luac on your development PC. This is freely available in Windows, Mac and Linux distributions, but we strongly suggest that you use Lua 5.1 to maintain source compatibility with ESP8266 code. This will allow you not only to unit test some modules on you PC in a rich development environment, but you can also use luac to list off bytecode listing of your code and syntactically validate new code before downloading to the ESP8266. This will also allow you to develop server-side applications and embedded applications in a common language.

User Tools

Site Tools

**This is an old revision of the document!**

Table of Contents