Improve SPI interface (GIT8266O-397) #723

zub2 · 2019-10-07T20:07:28Z

Environment

Development Kit: esp8266ex
IDF version: d6ec931
Development Env: Make
Operating System: Linux
Power Supply: external 3.3V

Problem Description

There are several issues that make the SPI driver difficult to use:

`spi_trans` (drivers/spi.c) requires 32bit-aligned buffers for miso and mosi data

This is not a big deal when the buffer is allocated for SPI, but when a longer buffer is sent piecewise, the necessary alignment becomes a burden. Consider the following scenario (SPI master):

uint16_t cmd = SOME_CMD;
uint32_t addr = SOME_ADDR;

spi_trans_t t = {};
t.cmd = &cmd;
t.bits.cmd = 16;
t.addr = &addr;
t.bits.addr = 32;

unsigned size;
uint8_t * data = getData(&size);
while (size > 0)
{
    waitForDeviceReady();
    const unsigned transferSize = min(size, getDeviceFreeBufferSize());

    t.mosi = (uint32_t*)data;
    t.bits.mosi = transferSize * 8;
    spi_trans(HSPI_HOST, t);

    data += transferSize;
    size -= transferSize;
}

... unless each time a multiple of 4 bytes is transferred, this explodes due to wrong alignment in spi_master_trans(). So the only solution currently is for user code to keep copying the data to a properly aligned buffer before it's passed to spi_trans(). So it's 1 copy before spi_trans() is called and another copy of the same data inside spi_master_trans().

The same issue happens with miso data.

Furthermore, making the type uint32_t* makes life difficult in all situations when the user doesn't want to transfer an array of uint32_ts. An uint8_t* or just void* would be more obvious. There is no data structure on this level - it's just bytes, or in fact just bits, as bits.mosi is actually the number of bits to transfer. So why force multiples of 4 bytes while the actual transfer size is bits?

I see that the actual SPI data buffer is defined as uint32_t (in spi_struct.h). I was not successful in finding any details on how should the HW buffer be accessed. But looking at how esp-open-rtos handles this (esp_spi.c in esp-open-rtos), it seems that a simple memcpy() (from a source address w/o any necessary alignment) is sufficient. It just seems that the last write has to make sure it ends on a multiple of 4 bytes. Alternatively, even if the SPI data buffer had to be accessed as uint32_t, it could still be handled inside spi_master_trans().

The size of `spi_trans_t` is 20 bytes yet `spi_trans()` accepts it by value

This means it's copied to stack. And then spi_trans() calls spi_trans_static() which calls one of spi_master_trans() or spi_slave_trans(). If the calls are not inlined, that's 3 unneeded copies of spi_trans_t. I think spi_trans(), spi_trans_static(), spi_master_trans() and spi_slave_trans() should accept a pointer to the spi_trans_t to spare stack space.

Making `spi_trans_t` `cmd` and `addr` pointers makes the usage for SPI master more difficult

When cmd or addr are an expression, a local variable has to be created. I understand why these were made pointers - for SPI slave. But why not just store the received values in the SPI slave scenario inside the spi_trans_t and let the caller then take the values and process them as needed? Of course this would only work if the spi_trans_t is passed via pointer, so that spi_slave_trans() can modify these.

The text was updated successfully, but these errors were encountered:

zub2 · 2019-10-07T20:12:40Z

For the alignment issue, I realized I am not using addr. So a work around for me is putting the 1 to 3 bytes into addr and using the rounded-up pointer as mosi. But this won't work when one also uses addr.

donghengqaz · 2019-10-12T03:19:58Z

I haved noticed problems what you meet, so we are refactoring the SPI driver, main work is about what your mentioned. And the other work is to increase the throughput of the SPI. So when version v3.3 is released, we may solve these problems..

zub2 · 2020-02-09T20:23:01Z

I see you made some changes in v3.3-rc1. Specifically I see the following:

spi_trans_t is now passed via pointer - great, thanks 👍
spi_trans_t still defines mosi and miso as uint32_t* but a comment implying 32-bit alignment is not strictly needed was added, and in code I see that when unaligned buffer is passed to spi_master_trans the following happens:

ESP_LOGW(TAG,"Using unaligned data may reduce transmission efficiency");
memset(spi_object[host]->buf, 0, sizeof(uint32_t) * 16);
memcpy(spi_object[host]->buf, trans->mosi, trans->bits.mosi / 8 + (trans->bits.mosi % 8) ? 1 : 0);
for (x = 0; x < trans->bits.mosi; x += 32) {
    y = x / 32;
    SPI[host]->data_buf[y] = spi_object[host]->buf[y];
}

... does it mean the SPI[host]->data_buf can't be accessed in smaller units than uint32_t? I.e. would memcpy((uint8_t*)SPI[host]->data_buf, trans->mosi, (trans->bits.mosi+7) / 8) not be possible? I'm asking especially as esp-open-rtos does not seem to have a problem with just a memcpy (see here) and having used the code, I can confirm it worked well. Although it is possible that it was still outside the spec for the ESP8266 SPI.
Also, the current code for unaligned handling is in fact quite sub-optimal, even if a simple memcpy into SPI[host]->data_buf can't be used: First it does the warning trace. I'd expect this to already be a performance hit in itself. Then it does a memset to fill spi_object[host]->buf with zeros. And then it does a memcpy of the user data to the same spi_object[host]->buf - so why the memset then? And then it does the loop copy.

vargavik · 2020-12-14T22:16:54Z

I'd like to point out an operator precedence problem with the trans->bits.mosi / 8 + (trans->bits.mosi % 8) ? 1 : 0 part too.
That should be trans->bits.mosi / 8 + ((trans->bits.mosi % 8) ? 1 : 0)

The current code will copy only exactly 1 byte no matter what is the content of trans->bits.mosi.
I had a non working code and had to drill down to logic analizer level to recognize only the first byte was correctly sent ot by the hardware. The same issue exist in the miso part.

vargavik · 2020-12-15T20:01:59Z

I found another bit more serious issue at the miso part.
If the data is aligned but the miso transmission byte length cannot be divided by 4 the following line will cause memory corruption for last incomplete 4 byte chunk of trans->mosi array:
trans->miso[y] = SPI[host]->data_buf[y];

The alignment check should also check the transfer length like this:
if ((uint32_t)(trans->miso) % 4 == 0 && trans->bits.miso % 32 == 0) {
for (x = 0; x < trans->bits.miso; x += 32) {
y = x / 32;
trans->miso[y] = SPI[host]->data_buf[y];
}
} else {

…e SPI interface (GIT8266O-397)"

github-actions bot changed the title ~~Improve SPI interface~~ Improve SPI interface (GIT8266O-397) Feb 9, 2020

vargavik mentioned this issue Dec 14, 2020

SPI transmit data on HSPI_HOST is wrong (GIT8266O-355) #791

Open

vargavik mentioned this issue Aug 23, 2021

Fix for #723 Bugfix/fix spi 4 byte alignment error (GIT8266O-397) (GIT8266O-704) #1126

Closed

vargavik pushed a commit to vargavik/ESP8266_RTOS_SDK that referenced this issue Aug 27, 2021

Fix for espressif#723, SPI communication problem described in "Improv…

af203f3

…e SPI interface (GIT8266O-397)"

vargavik added a commit to vargavik/ESP8266_RTOS_SDK that referenced this issue Aug 27, 2021

Fix for espressif#723, SPI communication problem described in "Improv…

09cddc7

…e SPI interface (GIT8266O-397)"

vargavik added a commit to vargavik/ESP8266_RTOS_SDK that referenced this issue Aug 27, 2021

Fix for espressif#723, SPI communication problem described in "Improv…

6d515c0

…e SPI interface (GIT8266O-397)"

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve SPI interface (GIT8266O-397) #723

Improve SPI interface (GIT8266O-397) #723

zub2 commented Oct 7, 2019 •

edited

Loading

zub2 commented Oct 7, 2019 •

edited

Loading

donghengqaz commented Oct 12, 2019

zub2 commented Feb 9, 2020

vargavik commented Dec 14, 2020

vargavik commented Dec 15, 2020

Improve SPI interface (GIT8266O-397) #723

Improve SPI interface (GIT8266O-397) #723

Comments

zub2 commented Oct 7, 2019 • edited Loading

Environment

Problem Description

spi_trans (drivers/spi.c) requires 32bit-aligned buffers for miso and mosi data

The size of spi_trans_t is 20 bytes yet spi_trans() accepts it by value

Making spi_trans_t cmd and addr pointers makes the usage for SPI master more difficult

zub2 commented Oct 7, 2019 • edited Loading

donghengqaz commented Oct 12, 2019

zub2 commented Feb 9, 2020

vargavik commented Dec 14, 2020

vargavik commented Dec 15, 2020

zub2 commented Oct 7, 2019 •

edited

Loading

`spi_trans` (drivers/spi.c) requires 32bit-aligned buffers for miso and mosi data

The size of `spi_trans_t` is 20 bytes yet `spi_trans()` accepts it by value

Making `spi_trans_t` `cmd` and `addr` pointers makes the usage for SPI master more difficult

zub2 commented Oct 7, 2019 •

edited

Loading